Network recording and speech analytics system and method

ABSTRACT

A system and method for network recording and speech analytics wherein a recording system receives media exchanged between first and second communication devices during a telephony call. The media is received by the recording system over a wide area network. The recording system bridges a media path between the first and second communication devices, and replicates media exchanged in the media path for storing the replicated media in a mass storage device. The recording system further captures metadata associated with the call, and stores the captured metadata in association with the stored media. The stored media and metadata may then be provided to a requesting device over the wide area network. The recording system may also be configured to analyze the call recording along with the associated metadata for detecting key words or phrases and/or triggering actionable events.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No.61/801,267, filed Mar. 15, 2013, the content of which is incorporatedherein by reference.

This application is also related to U.S. patent application Ser. No.14/015,983, filed on Aug. 30, 2013, entitled “System and Method forEncrypting and Recording Media for a Contact Center”, U.S. patentapplication Ser. No. 14/015,974 filed on Aug. 30, 2013, entitled “Systemand Method for Geo-Location Based Media Recording for a Contact Center”,U.S. patent application Ser. No. 14/015,960, filed on Aug. 30, 2013,entitled “System and Method for Handling Call Recording Failures for aContact Center”, and U.S. patent application Ser. No. 14.015,649, filedon Aug. 30, 2013, entitled “Call Event Tagging and Call RecordingStitching for Contact Center Call Recordings”, the content of all ofwhich are incorporated herein by reference.

BACKGROUND

There is a need in the art for a system and method for recording callsbetween a caller and callee. It is desirable for such calls to berecorded by devices hosted in a remote network system instead of deviseshosted at the premise of the caller or callee. There is also a need foranalyzing the recordings for detecting key words or phrases to takeaction based on the detection.

SUMMARY

Embodiments of the present invention are directed to a system and methodfor network recording comprising: receiving by a recording system havinga processor, memory, and mass storage device, media exchanged betweenfirst and second communication devices during a telephony call, themedia being received by the recording system over a wide area network;bridging by the recording system a media path between the first andsecond communication devices; replicating by the recording system mediaexchanged in the media path for storing the replicated media in the massstorage device; capturing by the recording system metadata associatedwith the call; storing the captured metadata in association with thestored media; and providing the stored media and metadata to arequesting device over the wide area network.

The metadata may include data from interactions with a customer of thefirst communication device, wherein the interactions are conducted overmedia channels other than a telephony channel. The metadata may includeinformation on a user of the first or second communication device. Themetadata may include information on the first or second communicationdevice. The metadata may include a word uttered during the call.

According to one embodiment, the recording system analyzes the storedmedia for detecting at least one of a key word or phrase.

According to one embodiment, the recording system encrypts thereplicated media via a first cryptographic key for storing the encryptedmedia in the storage device; and encrypts the first cryptographic keyvia a second cryptographic key for storing the encrypted firstcryptographic key as metadata for the encrypted media.

According to one embodiment, the processor is a media controller. A callcontroller in a first geographic region is configured to: establish thetelephony call between the first and second communication devices;identify a second geographic location associated with a resourceinvolved in the telephony call; and identify the media controller basedon location of the media controller in the identified second geographiclocation.

According to one embodiment, the processor is a first media controller.A call controller configured to: identify the first media controllercurrently assigned to the telephony call; detect failure of the firstmedia controller during the telephony call, wherein the failure of themedia controller tears down the media path; in response to detecting thefailure, bridge a second media path between the first and secondcommunication devices until a second media controller is identified; andin response to the second media controller being identified, signal thesecond media controller to bridge and record media exchanged during thetelephony call.

According to one embodiment, the metadata includes a link to a recordingof media exchanged during the telephony call. The method furtherincludes receiving by the recording system a call event associated withthe telephony call, the call event including a timestamp of when theevent occurred during the telephony call; storing by the recordingsystem the call event in a database record; retrieving by the recordingsystem the database record for displaying the call event on a displaydevice; receiving by the recording system a user command identifying thecall event in response to the display on the display device; andretrieving by the recording system a portion of the recording associatedwith the call event in response to the user command for providing anaudible rendering of the retrieved portion of the recording.

According to one embodiment, the present invention is also directed to asystem for recording media for a contact center. The system includes afirst processor and a first memory. The first memory has stored thereoninstructions that, when executed by the first processor, cause the firstprocessor to: receive media exchanged between first and secondcommunication devices during a telephony call, the media being receivedby the recording system over a wide area network; bridge a media pathbetween the first and second communication devices; and replicate mediaexchanged in the media path for storing the replicated media in the massstorage device. The system also includes a second processor coupled tothe first processor, and a second memory. The second memory has storedthereon instructions that, when executed by the second processor, causethe second processor to: capture metadata associated with the call; andstore the captured metadata in association with the stored media.

A person of skill in the art should recognize that the call recordingsystem according to various embodiments of the present invention allow abusiness to host call recording and speech analytics in a remote networkor “cloud” environment. There are several benefits to this solutionincluding: utility billing; zero footprint of equipment; and whole callrecordings (recording from call inception to call completion, which caninclude all interactions with an interactive voice response system,queue announcements, agent conversations, and supervisor conversations).Embodiments of the present invention also allow for quick and effectiverecording and analytics without replacing a business' PBX (privatebranch exchange) and/or ACD (automatic call distributor) solution.

These and other features, aspects and advantages of the presentinvention will be more fully understood when considered with respect tothe following detailed description, appended claims, and accompanyingdrawings. Of course, the actual scope of the invention is defined by theappended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic block diagram of a system for providing contactcenter services in a hybrid operations environment according to oneembodiment of the invention;

FIG. 2 is a schematic block diagram of a system for providing customerself-service in a hybrid operations environment according to oneembodiment of the invention;

FIG. 3 is a schematic block diagram of a system for providing outboundnotifications in a hybrid operations environment according to oneembodiment of the invention;

FIG. 4 is a schematic block diagram of a system for providing callparking services in a hybrid operations environment according to oneembodiment of the invention;

FIG. 5 is a schematic block diagram of a system for providing callprogress detection for outbound calls made in a hybrid operationsenvironment according to one embodiment of the invention;

FIG. 6 is a schematic block diagram of a system for call recording in ahybrid operations environment according to one embodiment of theinvention;

FIG. 7 is a signaling flow diagram for recording a call in a hybridoperations environment according to one embodiment of the invention;

FIG. 8 is a schematic block diagram of a hybrid operations environmentwith failover capabilities according to one embodiment of the invention;

FIG. 9 is a schematic block diagram of a hybrid operations environmentwith failover capabilities according to one embodiment of the invention;

FIG. 10 is a schematic layout diagram of distribution of various mediaservices in a hybrid operations environment according to one embodimentof the invention;

FIG. 11A is a schematic block diagram of a contact center systemillustrating cost and latency for an typical VoIP call without callrecording according to one embodiment of the invention;

FIG. 11B is a schematic block diagram of the contact center system ofFIG. 11A, illustrating cost and latency involved for the call betweenthe customer and agent, but with call recording enabled;

FIG. 11C is a schematic block diagram of a contact center systemconfigured for geo-location based call recording according to oneembodiment of the invention;

FIG. 12A is a schematic block diagram of a system for contact centercall recording and recording posting according to one embodiment of theinvention;

FIG. 12B is a schematic block diagram of a system for contact centercall recording and recording posting according to another embodiment ofthe invention;

FIG. 12C is a conceptual layout diagram of various componentsinteracting with a key management server for allowing encryption anddecryption of call recordings according to one embodiment of theinvention;

FIG. 12D is a more detailed block diagram of a recording user interfaceaccording to one embodiment of the invention;

FIG. 13 is signaling flow diagram for posting a recorded call accordingto one embodiment of the invention;

FIGS. 14A-14B are signaling flow diagrams for handling failure of amedia control platform during a recording according to one embodiment ofthe invention;

FIG. 15 is a conceptual layout diagram of process for recovering arecording upon failure and recovery of a media control platformaccording to one embodiment of the invention;

FIG. 16 is a diagram of a structure of call recording metadata providedto a web server according to one embodiment of the invention;

FIG. 17 is a diagram of a structure of call recording metadata providedto a web server according to one embodiment of the invention;

FIG. 18 is a conceptual layout diagram of a call record displayed by aclient playback application according to one embodiment of theinvention;

FIGS. 19 and 20 are diagrams of the structure of call recording metadatagenerated for different segments of a call according to one embodimentof the invention;

FIG. 21 is a schematic block diagram of a hybrid operations environmentfor providing media services according to one embodiment of theinvention;

FIG. 22 is a schematic block diagram of a hybrid operations environmentfor providing media services according to another embodiment of theinvention;

FIG. 23 is a schematic block diagram of a hybrid operations environmentfor providing media services according to another embodiment of theinvention; and

FIG. 24 is a schematic block diagram of a network recording and speechanalytics system according to one embodiment of the invention.

DETAILED DESCRIPTION

In general terms, embodiments of the present invention are directed to asystem and method for providing contact center services in a hybridoperations environment where some of the services are provided viasoftware and hardware resources in one operations environment whileother services are provided via software and hardware resources inanother operations environment. The operations environments may bedifferent due to a difference in their locations (e.g. local vs.remote), a difference in the entities controlling the resources in thetwo environments (e.g. different business enterprises), and/or the like.The environments used as examples for describing various embodiments ofthe invention are an operations environment at a contact center premise(also referred to as a local operations or computing environment), andan operations environment at a remote location (referred to as a remoteoperations or computing environment), although the invention is notlimited thereto. That is, a person of skill in the art should recognizethat the embodiments of the invention may extend to any two different orseparate operations environments conventional in the art.

In providing contact center services to customers of an enterprise, thesoftware and hardware resources of the contact center servicing theenterprise are invoked to engage in interactions with the customers. Theservices may vary depending on the type of contact center, and may rangefrom customer service to help desk, emergency response, telemarketing,order taking, and the like. The interactions that may ensue in order torender the services may include, for example, voice/telephony calls,emails, text messages, social media interactions, and the like.

According to embodiments of the present invention, control or influenceover an interaction is provided and retained in whole or in part by anappliance at the contact center premise while media is provided byresources in the remote operations environment. According to someembodiments, control or influence over an interaction is provided andretained in whole or in part by a resource in the remote operationsenvironment, while media is provided by resources in the localoperations environment. In further embodiments, a resource controllingan interaction may invoke media in one operations environment (e.g.local environment) for certain aspects of the interaction, and theninvoke media in a different operations environment (e.g. remoteenvironment) for other aspects of the interaction.

Unlike a traditional hybrid operations environment where control of aninteraction and media for the interaction are either in one operationsenvironment or another, embodiments of the present invention allow bothenvironments to be actively involved in the processing of theinteraction at the same time by, for example, providing control from oneenvironment and media from another environment.

FIG. 1 is a schematic block diagram of a system for providing contactcenter services in a hybrid operations environment according to oneembodiment of the invention. The system includes premise appliances 10at a contact center premise 12, and a remote platform 14 in a remoteoperations environment 16. Both the premise appliances 10 and the remoteplatform 14 include software, hardware, and network infrastructure thatmake up different contact center components for providing contact centerservices to a customer having access to an end user device 18. Exemplarycontact center components include, without limitation, a switch and/ormedia gateway, telephony server, Session Initiation Protocol (SIP)server, routing server, media server, recording server, outbound callserver, statistics server, reporting server, web server, configurationserver, and/or the like. Each server may include a processor and memorystoring instructions which, when executed by the processor, allow afunction of the server to be performed. The various servers may also bereferred to as controllers and may be implemented via an architectureother than a client-server architecture.

According to one embodiment, the contact center components aredistributed between the premise 12 and the remote operations environment16. In this regard, a particular contact center component may beprovided by either the premise 12 as part of the premise appliances 10,or by the remote operations environment 16 via the remote platform 14.In some embodiments, a particular contact center component may beprovided by both the premise 12 and the remote operations environment16. In this regard, logic in either the premise or in the remoteoperations environment may determine, dynamically (e.g. upon arrival ofa call) which component to invoke.

According to one embodiment, the remote operations environment 16 is acloud operations environment that utilizes servers and other types ofcontrollers, and is coupled to premises contact centers over a wide areanetwork. Contact center services from the remote operations environmentmay be provided by a cloud service provider on behalf of multiplecontact centers (also referred to as tenants) as a software as a service(SaaS), over the wide area network. The tenants may own their owninfrastructure for providing some of the contact center services. Theinfrastructure and capabilities at the tenant premises may differ fromthe infrastructure and capabilities in the remote operationsenvironment. According to one embodiment, the premise contact center maybe operated by enterprise operations team while the remote operationsenvironment may be operated by an operations team outside of theenterprise.

The remote operations environment 16 is configured to provide a point ofpresence for connection to various telephony service providers.According to one embodiment, media traffic transmitted using a Real-timeTransport Protocol (RTP) terminates in the remote operationsenvironment. The remote operations environment may provide a guaranteedquality of service (QoS) for the media traffic. In another embodiment,no QoS guarantees are provided for the media traffic traversing theremote operations environment 16.

According to one embodiment, the remote operations environment 16includes an edge device 20 configured to control signaling and mediastreams involved in setting up, conducting, and tearing down voiceconversations and other media communications between, for example, acustomer and a contact center agent. According to one embodiment, theedge device 20 is a session border controller controlling the signalingand media exchanged during a media session (also referred to as a“call,” “telephony call,” or “communication session”) between thecustomer and the agent. According to one embodiment, the signalingexchanged during a media session includes SIP, H.323, Media GatewayControl Protocol (MGCP), and/or any other voice-over IP (VoIP) callsignaling protocols conventional in the art. The media exchanged duringa media session includes media streams which carry the call's audio,video, or other data along with information of call statistics andquality.

According to one embodiment, the edge device 20 operates according to astandard SIP back-to-back user agent (B2BUA) configuration. In thisregard, the edge device 20 is inserted in the signaling and media pathsestablished between a calling and called parties in a VoIP call. In thebelow embodiments, it should be understood that other intermediarysoftware and/or hardware devices may be invoked in establishing thesignaling and/or media paths between the calling and called parties.

According to one embodiment, the remote platform 14 is a multi-tenantplatform shared by multiple tenants. The platform includes standardhardware components such as, for example, one or more processors, disks,memories, and the like, used for implementing one or more of the contactcenter components (e.g. media server, recording server, SIP server,etc.). According to one embodiment, the one or more contact centercomponents are implemented as software on the remote platform. Thesoftware components may be hosted by one or more virtual machines. Thevirtual machines may be dedicated to each tenant, or shared among thevarious tenants.

The appliances 10 maintained at each contact center premise 12 includecontact center components which may or may not be included in the remoteoperations environment 16. For example, the appliances may include atelephony/SIP server, routing server, statistics server, agent devices(e.g. telephones, desktops, etc.), and/or other controllers typical forrendering contact center services for the particular contact center.Because the appliances are located locally within the contact centerpremise, the contact center retains control of such appliances.

According to one embodiment, VoIP infrastructure 26 (e.g. SIP trunk) isused to provide connectivity between a public switched telephony network(PSTN) 24 and the private network 22. According to one embodiment, theprivate network 22 implements MPLS (Multi-Protocol Label Switching) fortransmitting VoIP communication over a wide area network (WAN) vialeased lines. Although MPLS is used as an example, a person of skill inthe art should recognize that any other mechanism in addition or in lieuof MPLS may be used for ensuring quality of service guarantees, bitrates, and bandwidth for calls traversing the private network. Due tothe quality of service guarantees provided by the private network 22,consistent call quality and security can generally be expected for thosecalls while traversing the private network.

According to one embodiment, the edge device 20 in the remote operationsenvironment 16 exerts control over the signaling (e.g. SIP messages) andmedia streams (e.g. RTP data) routed to and from customer devices 18 andpremise appliances 10 that traverse the private network 22. In thisregard, the edge device 20 is coupled to trunks 28 that carry signalsand media for calls to and from customer devices 18 over the privatenetwork 22, and to trunks 30 that carry signals and media to and fromthe premise appliances 10 over the private network. The edge device 20is also coupled to the remote platform 14 which provides contact centerservices to the customers.

The remote operations environment 16 may also be coupled to other publicoperations environments (e.g. public cloud computing environments), andsome processing may be distributed to the other remote operationsenvironments as will be apparent to a person of skill in the art. Forexample, processing intelligence and media handling that do not requireQoS may be distributed to the other remote operations environments onbehalf of one or more tenants. For example, the public operationsenvironment may host a virtual machine dedicated to each tenant with aSIP server, routing service, and the like, for handling inbound andoutbound voice contacts.

I. Contact Center Services in Hybrid Environment

FIG. 2 is a schematic block diagram of a system for providing customerself-service in a hybrid operations environment according to oneembodiment of the invention. The customer self-service may be referredas an interactive-voice-response (IVR) self-service. In this regard, theremote platform 14 provides a voice platform 58 for multiple subscribingtenants for providing customer self-service functionality for inboundcalls directed to any of the multiple tenants. Although self-service andassisted-service capabilities are contemplated to be provided by thevoice platform, a person of skill in the art should recognize that othertypes of assisted service, multimedia interactions, and applicationsoutside of the contact center are also possible.

The voice platform 58 may host, for example, a SIP server 56, resourcemanager 50, speech servers 54, and a media control platform 52. Theresource manager 50 and media control platform 52 may collectively bereferred to as a media controller. According to one embodiment, the SIPserver 56 acts as a SIP B2UBA, and controls the flow of SIP requests andresponses between SIP endpoints. Any other controller configured to setup and tear down VoIP communication session may be contemplated inaddition or in lieu to the SIP server as will be apparent to a person ofskill in the art. The SIP server 56 may be a separate logical componentor combined with the resource manager 50. In some embodiments, the SIPserver may be hosted at the contact center premise 12, and/or in theremote operations environment. Although a SIP server is used as anexample in the various embodiments of the present invention, a person ofskill in the art should recognize that any other call server configuredwith any other VoIP protocol may be used in addition or in lieu of SIP,such as, for example, the well-known H.232 protocol, Media GatewayControl Protocol, Skype protocol, and the like.

The resource manager 50 is configured to allocate and monitor a pool ofmedia control platforms for providing load balancing and highavailability for each resource type. According to one embodiment, theresource manager 50 monitors and selects a media control platform 52from a cluster of available platforms. The selection of the mediacontrol platform 52 may be dynamic, for example, based on identificationof a location of a calling customer, type of media services to berendered, a detected quality of a current media service, and the like.

According to one embodiment, the resource manager is configured toprocess requests for media services, and interact with, for example, aconfiguration server having a configuration database, to determine aninteractive voice response (IVR) profile, voice application (e.g. VoiceExtensible Markup Language (Voice XML) application), announcement, andconference application, resource, and service profile that can deliverthe service, such as, for example, a media control platform. Accordingto one embodiment, the resource manager may provide hierarchicalmulti-tenant configurations for service providers, enabling them toapportion a select number of resources for each tenant.

According to one embodiment, the resource manager is configured to actas a SIP proxy, SIP registrar, and/or a SIP notifier. In this regard,the resource manager may act as a proxy for SIP traffic between two SIPcomponents. As a SIP registrar, the resource manager may acceptregistration of various resources via, for example, SIP REGISTERmessages. In this manner, the voice platform 58 may support transparentrelocation of call-processing components. In some embodiments,components such as the media control platform, do not register with theresource manager at startup. The resource manager detects instances ofthe media control platform 52 through configuration informationretrieved from the configuration database. If the media control platformresource group has been configured for monitoring, the resource managermonitors resource health by using, for example, SIP OPTIONS messages.For example, to determine whether the resources in the group are alive,the resource manager periodically sends SIP OPTIONS messages to eachmedia control platform resource in the group. If the resource managerreceives an OK response, the resources are considered alive.

According to one embodiment, the resource manager act as a SIP notifierby accepting, for example, SIP SUBSCRIBE requests from the SIP server 56and maintaining multiple independent subscriptions for the same ordifferent SIP devices. The subscription notices are targeted for thetenants that are managed by the resource manager. In this role, theresource manager periodically generates SIP NOTIFY requests tosubscribers (or tenants) about port usage and the number of availableports. The resource manager supports multi-tenancy by sendingnotifications that contain the tenant name and the current status (in-or out-of-service) of the media control platform that is associated withthe tenant, as well as current capacity for the tenant.

The resource manager is configured to perform various functions:

Resource management—The resource manager allocates and monitors SIPresources to maintain a current status of the resources within a voiceplatform 58 deployment. In this regard, the resource manager providesload balancing and high availability for each resource type, as theworkload is evenly distributed among resources of the same type. Theseprocesses help to ensure that new, incoming services are not interruptedwhen a resource is unavailable.

Session management—The resource manager combines two logical functionsof session management:

Physical resource management—The resource manager monitors the status ofthe various voice platform resources and, based on request-for-serviceand capability mapping, routes to resources that offer a particular setof capabilities or services.

Logical service management—The resource manager applies high-levelapplication and business logic to select the service that is deliveredand the parameters that are applied. In this regard, the resource tofulfill the service does not need to be specified in advance. In thisway, the resource manager provides session management functions tohandle logical call sessions, individual calls within a logical session,and the lifetime and coordination of call legs within a call session.

Service selection—When a call session arrives at the resource manager,the resource manager maps the call to an IVR profile and, if applicable,to a tenant, and selects a service for the request. There are variousways in which the resource manager may determine which IVR profile toexecute. According to one embodiment, a dialed number identificationservice (DNIS) may be used to identify which application to run. In thisscenario, the incoming call corresponds to the DNIS.

According to one embodiment, when a platform administrator segregatesservices into a multi-tiered hierarchy, the resource manager alsoidentifies the tenant for which a request is intended. The IVR profile,policy enforcement, and service parameters may be determined by thetenant that is associated with the request. In a hierarchicalmulti-tenant (HMT) environment, when a tenant is selected, the policiesenforced, and application and service parameters associated with thattenant, may also affect the child tenants within that tenant object.

After the resource manager has determined the IVR profile for a session,it identifies the service type and the service prerequisites for eachcall leg (also referred to as a call path or segment of a callconnection). For each type of service within an IVR profile, one mayconfigure a set of service parameters that the resource manager forwardsto the VoiceXML application to affect the way that the application isexecuted. For example, default languages may be configured for theVoiceXML services for voice applications.

Policy enforcement—According to one embodiment, for each IVR Profileand, if applicable, for each tenant, policies may be configured such as,for example, usage limits, dialing rules, and service capabilities. Theresource manager enforces policies by imposing them on the VoiceXMLapplication to determine whether or not to accept a SIP session. If thesession is accepted, the resource manager locates a resource to handleit. The resource manager may also enforce policies related to how aVoiceXML or CCXML application uses a resource. For multiple tenants, theresource manager may be configured to apply and enforce policies in ahierarchical manner. HMT enables a service provider or parent tenant toallocate portions of its inbound ports to each reseller (or childtenant). The reseller can, in turn allocate ports to a number of childtenants within its tenant object. When tenant policies are enforced atthe child tenant level, the policies are propagated to all other childtenants within that child tenant object.

Service request modification—According to one embodiment, before theresource manager forwards a request to a resource that can handle themapped service, it can modify the SIP request to add, delete, or modifythe SIP parameters. This may be defined on a per-service/per-applicationbasis.

Resource selection—After the resource manager has identified an IVRProfile and service type, it identifies a resource group that canprovide the service. Then, on the basis of the load-balancing scheme forthe group and the status of individual physical resources in the group,it allocates the request to a particular physical resource.

Resource selection with geo-location information—When the resourcemanager receives a request with geo-location information from a gatewayresource (SIP Server), it checks the resource groups to determine if thegeo-location parameter that is configured for the group matches thegeo-location in the request. If it finds a match, the resource managerroutes the call to the group based on port availability, preference andother criteria.

Resource selection for outbound campaigns—For outbound-call campaigns,the resource manager is configured to predict the ratio of agent callsto customer calls. When there are multiple media control platforms in adeployment, the resource manager may distribute calls based on themaximum number of calls and free ports for a particular campaign.

Call-data reporting—When data collection and logging events occur, theresource manager sends these log events to, for example, a reportingserver.

In some embodiments, the voice platform 58 may not include a resourcemanager 50, or the functionality of the resource manager 50 may beincorporated into another voice platform component, such as, forexample, the media control platform 52.

Referring again to FIG. 2, the speech servers 54 are configured withspeech recognition technology to provide automatic speech recognitionand text-to-speech functionality for use in voice applications.

The media control platform 52 is configured to provide call and mediaservices upon request from a service user. Such services, include,without limitation, initiating outbound calls, playing music orproviding other media while a call is placed on hold, call recording,conferencing, call progress detection, playing audio/video promptsduring a customer self-service session, and the like. One or more of theservices are defined by voice applications 60 a, 60 b (e.g. VoiceXMLapplications) that are executed as part of the process of establishing amedia session between the media control platform and the service user.

According to one embodiment, the voice platform 58 is shared by variouscontact centers for which contact center services are provided.According to this embodiment, multiple voice applications for multipletenants run on the same media control platform instance withoutinterfering with one another. Identification of the tenant (e.g. basedon the telephone number dialed by the customer), for which a voiceapplication is run, allows a proper voice application to be selected andexecuted for that call.

In one example where customer self-service is to be provided for aninbound call, the call comes in to the edge device 20 and is forwardedto the SIP server 56. The edge device 20 is configured to identify atenant to which the call is directed, and identify the SIP server 56configured for the tenant (e.g. based on the inbound phone number thatwas dialed). According to one embodiment, the SIP server 56 passes thecall to the resource manager 50 by sending a signaling message (e.g. SIPINVITE message) to the resource manager. According to one embodiment,there is no separate SIP server 56 set up for the tenant, and some ofthe functionalities of the SIP server are instead incorporated into theresource manager 50. According to one embodiment, the resource manageris shared by multiple tenants.

The resource manager is configured to identify the contact centerassociated with the SIP server 56 generating the signaling message (e.g.based on a source address of the SIP server), and further identify avoice or call-control application (referred to as an interactive voiceresponse (IVR) profile), and a service/resource for the request. Theparticular service that is requested may be identified, for example, inthe signaling message to the resource manager.

The resource manager 50 is configured to identify the appropriate mediacontrol platform 52 instance from a cluster of media control platforminstances based on the IVR profile, load balancing considerations, andthe like, and forward a request to the identified media controlplatform. In forwarding the request, the resource manager is configuredto insert additional headers or parameters as specified by the servicerequirements, service parameters, and polices that have been configuredfor the IVR profile.

The media control platform 52 is configured to fetch the voiceapplication 60 a, 60 b from, for example, a web server, via an HTTPrequest. The web server hosting the voice application 60 a, 60 b mayreside in the remote operations environment 16 or contact center premise12.

According to one embodiment, the media control platform 52 includes aninterpreter module for interpreting and executing the voice application.In some embodiments, the media control platform, through the resourcemanager 50, may invoke additional services such as, for example,automatic speech recognition or text-to-speech services, from the speechservers 54.

An RTP media path 62 is established between the media control platform52 and the end user device 18 through the edge device 20, upon theexecuting of the voice application. The resource manager 50 ends thecall when one of the parties (end user device 18 or media controlplatform 52) disconnects (e.g. at the end of self-service), or when thecall is transferred out of the voice platform 58 (e.g. transferred to anagent).

FIG. 3 is a schematic block diagram of a system for providing outboundnotifications in a hybrid operations environment according to oneembodiment of the invention. The system is similar to the system in FIG.2 in that it includes a remote voice platform 58′ which hosts the SIPserver 56, resource manager 50, and media control platform 52. Inaddition, the voice platform 58′ further hosts an outbound gateway 55configured to manage the initiation of outbound sessions. According toone embodiment, an outbound session is controlled by an outboundapplication 100, which in the illustrated embodiment, is depicted toreside in a web server (not shown) at the contact center premise 12. Aperson of skill in the art should recognize, however, that the outboundapplication may also reside in a server hosted by the remote operationsenvironment 16.

According to one embodiment, the outbound application initiates anoutbound call session via an HTTP request to the outbound gateway 55over a data link 102 traversing the private network 22. The requestincludes, in one embodiment, the necessary information for initiatingthe outbound call which may be provided by the outbound application. Forexample, the outbound application may control the timing of the call,the number to be called, and a voice application 108 a, 108 b to beinvoked for the call.

The outbound gateway 55 is coupled to the SIP server 56 which isconfigured to establish call legs from the edge device 20 to the enduser device 18, and from the edge device 20 to the media controlplatform 52, and bridge the two call legs together for establishing amedia path 106 between the end user device 18 and the media controlplatform 52. The voice notification provided to the customer during theoutbound call depends on the voice application 108 a, 108 b identifiedby the outbound application 100. As in the embodiment of FIG. 2, thevoice application may be retrieved from a web server in the contactcenter premise 12 or in the remote operations environment 16.

Upon completion of the outbound notification, the outbound gateway 55 isconfigured to collect results of the call from the media controlplatform 52, and provide such results to the outbound application 100 ina notification message.

FIG. 4 is a schematic block diagram of a system for providing callparking services in a hybrid operations environment according to oneembodiment of the invention. According to this embodiment, a SIP server70 similar to the SIP server of FIG. 2 is hosted at the contact centerpremise 12 instead of the remote operations environment 16. The premisefurther hosts a routing server 72 configured to route an interaction toa contact center resource based on a routing strategy identified by therouting server. The SIP and routing servers 70, 72 being local to thepremise may also be referred to as local controllers. Media services areprovided remotely, however, via the resource manager 50 and medialcontroller 52 in the remote operations environment 16.

In one example, an inbound VoIP call is received by the edge device 20and routed to the SIP server 70. The SIP server 70 queues the calllocally at the contact center premise and transmits a message to therouting server 72 for routing the call to an available contact centerresource (e.g. agent). In the event that no resources are available forhandling the call, the routing server 72 transmits a message to the SIPserver 70 over a local data connection 74 of this fact. In response, theSIP server 70 queues the call locally in an inbound queue, and transmitsto the resource manager 50 over a data link 78 traversing the privatenetwork 22, a request for call parking media services. The resourcemanager 50 identifies the appropriate media control platform 52 tohandle the request, and upon identification of such a platform, a mediachannel/path 80 is established between the end user device 18 and themedia control platform 52 via the edge device 20. Although control ofthe call is retained by the SIP server 70 at the contact center premise,the media channel 80 need not loop through the contact center premise.According to one embodiment, the SIP server 70 retains control of thecall by transmitting signaling messages to various components, includingthe resource manager 50, to control the media paths that are generatedand/or broken down.

As part of the call parking service, the media control platform 52 mayuse the media channel 80 to provide media such as, voice notificationsand/or music, to the customer, for indicating that no agents arecurrently available. The voice notifications and/or music that areselected may depend on the voice application retrieved by the mediacontrol platform. As part of the call parking service, the media controlplatform may also be configured to periodically transmit a message tothe routing server 72 requesting an amount of estimated wait timecalculated by the routing server 72. The request may be transmitted overa data link 76 that traverses the private network 22. In response, therouting server 72 provides the requested information to the mediacontrol platform 52, and is used by the voice application to outputcorresponding audio (e.g. “we estimate your wait time to be between 5and 10 minutes”) via the media channel established between the mediacontrol platform and the end user device 18.

The routing server 72 is configured to monitor for availability of thecontact center resource, and upon identification of such a resource,transmits a message to the SIP server 70. In response to theavailability message, the SIP server 70 is configured to transmit amessage to the resource manager 50, via the data link 78, requestingtermination of the call parking service. In this manner, serviceprovided by the media control platform 52 is revoked by the local SIPserver 70 who retains control of the call while media services are beingprovided from the remote operations environment. The media controllercontrols the media based on the request, and terminates the call parkingservice. Upon exchange of signaling messages between the SIP server 70and the identified contact center resource, such as, for example, anagent device at the contact center premise 12, a call leg is establishedfrom the edge device 20 to the contact center resource to allow exchangeof media between the customer and the contact center resource. Thecontrol signals transmitted by the SIP server 70, therefore, replaces acall leg between the edge device 20 and the media control platform 52 inthe remote operations environment 16, with a new call leg establishedbetween the edge device 20 and the contact center resource at thecontact center premise.

FIG. 5 is a schematic block diagram of a system for providing callprogress detection for outbound calls made in a hybrid operationsenvironment according to one embodiment of the invention. According tothis embodiment, the local contact center premise 12 hosts a SIP server90 and an outbound call server 92 as local appliances 10, while theremote platform 14 in the remote operations environment 16 hosts theresource manager 50 and media control platform 52. The SIP server 90 maybe similar to the SIP server 56 of FIG. 2, and may be configured toreceive commands to initiate an outbound call as directed by theoutbound call server 92. In this regard, the outbound call server 92 maybe configured with an outbound application (not shown) which providescall control during, for example, an outbound campaign. The outboundapplication may be similar to the outbound application 108 a, 108 b, ofFIG. 3. In this regard, the outbound application may control the timesand numbers to call, the voice applications to be invoked, and the like.A difference in the outbound applications is that the outboundapplication in FIG. 3 controls the media control platform to leave amessage if the call is picked up by a person or an automated answeringsystem, while the outbound application in FIG. 5 controls the mediacontrol platform to send a message if the call is picked up by a personfor connecting the call to an agent

According to one embodiment, an outbound call is initiated as instructedby the outbound application executed by the outbound call server 92, ina manner similar to what was discussed with respect to the embodiment ofFIG. 3. According to one embodiment, the media control platform providesthe media for the outbound notification. In addition, the media controlplatform 52 may be configured to provide call progress detection basedon for example, a request for such service from the SIP server 90 asdetermined by the executed outbound application. The request forinitiating the outbound call and for call progress detection may betransmitted via a data link 120 that traverses the private network 22.

In response to the request for call progress detection, the mediacontrol platform 52 monitors the call progress for identifyingtriggering actions, such as, for example, the answering (or not) of theoutbound call, including identifying the type of device or personanswering the call (if at all). The call progress information isforwarded to the outbound call server 92 over a data link 122 as well asto the SIP server over data link 120. In response to the information,the outbound call server 92 may update its records, attempt calls toalternate numbers (in case a call to a first number was unsuccessful),and the like.

According to one embodiment, in response to receiving an update that acustomer (as opposed to an answering machine or fax machine) hasanswered the call, the SIP server 90 may be configured to transmit amessage to the outbound call server, to connect the customer with a liveagent. According to one embodiment, the outbound call server 92 may beconfigured to match an agent camping on a media control platform to theanswering customer that is connected to the same media control platform.Once the agent is identified, the call is connected by establishing acall leg from the edge device 20 to the device of the identified agent.This results in the call leg between the edge device 20 and the mediacontrol platform 52 being replaced with the call leg between the edgedevice 20 and the agent device.

FIG. 6 is a schematic block diagram of a system for call recording in ahybrid operations environment according to one embodiment of theinvention. This embodiment is similar to the embodiments of FIGS. 4 and5 in that the resource manager 50 and media control platform 52 arehosted by the remote platform 14 in the remote operations environment16. In addition to the resource manager and media control platform, theremote platform further hosts a recording server 400 configured torecord media exchanged during a media session. Although the recordingserver 400 is depicted as a separate component, a person of skill in theart should recognize that functionality of the recording server may beincorporated into the media control platform 52.

According to one embodiment, the media control platform 52 is configuredfor active recording. Unlike passive recording where VoIP recording isdone by connecting a passive recording system to a switch to monitor allnetwork traffic and pick out only the VoIP traffic to record, activerecording allows a recording device to be an active participant in thecall for recording purposes. In this regard, the media control platform52 is in the media path established between two communicating parties inorder to actively record (e.g. replicate and store) the media traversingthe media path.

According to one embodiment, the contact center premise hosts a SIPserver 402 which may be similar to the SIP server 70 of FIG. 4, toinitiate a call recording of a call established between the end userdevice 18 and an agent device 404, via the media control platform in theremote operations environment 16. In response to a request for recordingservices, the media control platform 52 performs media bridging 406between the end user device 18 and the agent device 404, and initiates arecording session. The media control platform 52 replicates the media408 a, 408 b to and from the end user device 18 and the agent device404, and streams the replicated media to the recording server 400 whichthen proceeds to store the replicated media in a local and/or remotestorage device (not shown). The local storage device may be, forexample, a disk storage mechanism (e.g. disk array) in the remoteoperations environment 16 that may be scaled for the cluster of mediacontrol platforms in the remote operations environment. The remotestorage device may be hosted, for example, in an environment (e.g. apublic cloud computing environment) separate from the remote operationsenvironment 16. According to one embodiment, the storage devices storemedia recordings for a plurality of tenants, in a safe and securemanner. In this regard, the recordings are stored in the storage devicesin an encrypted manner (e.g. via a public key), which is configured tobe decrypted (e.g. for listening) by the tenant who may own, forexample, a private key.

According to one embodiment, the recording server 400 is configured toreceive metadata of the call recordings from the SIP server 402 over adata link 410. The metadata may be stored in association with thecorresponding call recordings in the same or separate data storagedevice as the actual call recordings. According to one embodiment, themetadata is stored as header data for the call recordings.

Recording can be enabled from routing strategy by sending aRequestRouteCall message from the SIP server 402 to the media controlplatform 52 with extension key “record” and value set to “source” torecord all legs until customer leaves the call, or “destination” torecord while the target agent is on the call. Choosing recording using arouting strategy is referred to as selective recording. According to oneembodiment, in recording based on a routing strategy, a tenant'srecording parameters are checked for identifying a percentage of callsto be recorded and requesting recording for a particular call based onthe identified percentage.

According to one embodiment, the SIP server 402 may be configured torecord calls for specific agent DNs, or for all incoming calls.According to one embodiment, a “norecord” extension key may be supportedfor the RequestRouteCall message. When a “norecord” key is set, norecording is performed even if the call is set to record at the DNlevel. Dynamic recording control may still be allowed, however, afterthe call is established, so as to allow the agent to being recording thecall when desired.

According to one embodiment, the agent device 404 may provide agraphical user interface with dynamic recording controls for allowingthe agent to start, pause, resume, and stop a recording. According toone embodiment, commands for controlling the recording are forwarded bythe SIP server 402. Other clients other than the agent device 404 mayprovide the recording commands even if not party to the call.

FIG. 7 is a signaling flow diagram for recording a call in a hybridoperations environment according to one embodiment of the invention. Theflow begins with step 420 where a media session is established betweentwo communication devices referred to as party A 440 and party B 442.

In steps collectively identified as steps 422 and 424, a pre-negotiationphase ensues between the SIP server, resource manager, and media controlplatform 52, for providing a copy of the established media sessionbetween party A 440 and party B 442, to the media control platform 52.According to one embodiment, the information on the media session withparty A is provided to the media control platform 52 in step 422 via theresource manager 50 via a session description protocol (SDP) thatincludes information such as, for example, IP address, port number, andcodec used for sending and receiving RTP streams with party A.Information on the media session with party B is similarly provided tothe same media control platform in step 424.

In steps collectively referred to as step 426, the SIP server 402transmits a request to the media control platform 52 to record the call.In this regard, during signaling which is collectively referred to asstep 428, the SIP server 402 transmits an INVITE message to the mediacontrol platform 52 (via the resource manager 50), for establishing amedia path with party A 440, in which case the media control platformgenerates a session based on the session information received in thepre-negotiation phase in step 422 for party A. A media path for thegenerated media session is then established via signaling between theSIP server 402 and party A 440, as shown collectively as step 430.

Similarly during signaling which is collectively referred to generallyas step 432, the SIP server 402 transmits an INVITE message to the mediacontrol platform 52 (via the resource manager 50), for establishing amedia path with party B 442. The media control platform generates, inresponse, a session based on the session information received in thepre-notation phase in step 424 for party B. A media path for thegenerated media session is then established via signaling between theSIP server 402 and party B 442, as shown collectively as step 434.

Media is then exchanged via established media paths 436 and 438. In thismanner, the media control platform 52 bridges media between party A 440and party 460, and records the exchanged media in step 439.

II. Handling Connection Failures in Hybrid Environment

FIG. 8 is a schematic block diagram of a hybrid operations environmentwith failover capabilities according to one embodiment of the invention.An inbound call from the customer end device 18 is forwarded to the SIPserver 56 for routing to a contact center agent. In the illustratedembodiment, the contact center agent registers with the SIP server 56 adirectory number associated with an agent telephone 200. The agent alsohas access to a desktop 202 which may be used for receiving data aboutthe inbound call from the SIP server 56. According to one embodiment,the data is transmitted over a data link 204 over a wide area networkwhich may not utilize the same connections used for the private network22. The desktop 202 may also provide a graphical user interface withcall control options, such as, for example, options for answering calls,putting calls on hold, transferring calls, and the like.

According to one embodiment, the SIP server 56 is configured to monitoron a regular or irregular basis, the status of a connection to the agentdevice 200. In this regard, the SIP server 56 may be configured totransmit polling/heartbeat messages to the agent device 200 over a datalink 206 traversing the private network 22, and wait for anacknowledgement within a preset amount of time. If the SIP server doesnot receive the acknowledgement within the set time period, the SIPserver may be configured to assume that data link 206 or agent device200 is faulty. In this case, the SIP server is configured to retrieve alist of alternate numbers (e.g. direct inward dialing (DID) numbers) toalternate phones 208 maintained by the SIP server for the agent.According to one embodiment, the alternate number is a number that isnot used by any agent for registering with the SIP server.

In response to identifying the alternate number, calls to be routed tothe agent are sent to the alternate phone number instead of thedirectory number in a seamless manner. According to one embodiment, calldata continues to be delivered to the agent desktop 202 over the datalink 204 which is not affected by the faulty data link 206 traversingthe private network 22. According to one embodiment, the agent mayengage in call control via the agent desktop for controlling callsrouted to the alternate number. Routing to the directory number for theagent resumes when connection to the agent device 200 over the data link206 is functional again.

According to one embodiment, a media path 205 a, 205 b from the end userdevice 18 to the alternate phone 208 is bridged through the mediacontrol platform 52, as shown in FIG. 8, if the call between thecustomer and the agent is to be recorded. Otherwise, the media path isbridged through the edge device 20 without traversing through the mediacontrol platform.

FIG. 9 is a schematic block diagram of a hybrid operations environmentwith failover capabilities according to one embodiment of the invention.In the illustrated embodiment, the SIP server 56 is deployed in anactive/hot-standby pair. For example, the remote SIP server 56 in theremote operations environment 16 may be deployed as a primary instance,while a local SIP server 250 in the contact center premise is deployedas a standby (failover) instance. Although a SIP server is used as anexample for which failover capabilities are provided, a person of skillin the art should recognize that other contact center components mayhave similar failover capabilities.

According to one embodiment, an agent registers with the local SIPserver 250 his or her registration information including, for example, adirectory number associated with an agent device 252. The local SIPserver 250 deployed as the hot-standby instance proxies the registrationto the remote SIP server 56 deployed as the primary instance. In thisregard, a copy of the agent registration information is forwarded to theremote SIP server 56 over a data link 254 for storing therein.

In the illustrated embodiment, an inbound call arrives at a mediagateway which attempts to transmit, over a data link 258 traversing theprivate network 22, a request to route the call to the remote SIP server56. If the request is successfully received by the remote SIP server 56,and assuming that the call is to be routed to the agent device 252, theSIP server signals the media gateway 256 to route the call to the agentdevice based on the registration information stored at the remote SIPserver 56. A media channel 260 is then established to the agent device252 for communicating with the end user device 18.

In the event, however, that the remote SIP server 56 does not respondwithin a preset amount of time to the request to route from the mediagateway 256, the local SIP server 250 takes over, and the media gatewayproceeds to send the request to the local SIP server over a local datalink 262.

FIG. 10 is a schematic layout diagram of distribution of various mediaservices in a hybrid operations environment according to one embodimentof the invention. The media services include but are not limited to callprogress detection 300, conference 302, music-on-hold 304, call parking306, call recording 308, and IVR self-service. Services such asconference 302 and music-on-hold 304 may be provided by one or moremedia controllers 310 at the contact center premise 12 by storing in theSIP server as the contact parameter 314, 316 for these services, theaddress of the resource manager at the contact center premise 12. Otherservices such as call progress detection 300, call recording 308, andIVR self-service 323 may be provided by one or more media controllers312 at the remote operations environment by storing in the SIP server asthe contact parameter 320, 321, 324 for these services, the address ofthe resource manager at the remote operations environment 16.

Other services, such as, for example, call parking 306 may be configuredto be provided by media controllers 310, 312 at the contact centerpremise 12 as well as in the remote operations environment 16, in orderto provide overflow support. The media controller that is to be invokedfirst is determined by a priority level stored by the SIP server in thecontact parameter 318, 322 set for the service. In the illustratedexample, the priority level set for the media controller 310 at thecontact center premise (e.g. priority=0) signifies a higher prioritythan a priority level set for the media controller 312 in the remoteoperations environment (e.g. priority=1).

The SIP server transmits a request for media service to the mediacontroller 310 at the higher priority. If the media controller 310 hasreached a maximum threshold configured for the media controller, the SIPserver receives a SIP response from the resource manager indicating thisfact. The SIP server then sends the request to the overflow mediacontroller 312 at the lower priority. The overflow media controller 312continues to provide media services in response to requests from the SIPserver until the load in the primary media controller 310 falls below adesired threshold.

FIG. 21 is a schematic block diagram of a hybrid operations environmentfor providing media services according to one embodiment of theinvention. In the illustrated embodiment, a SIP server 2100 is deployedat the contact center premise 12. The SIP server may be similar, forexample, to the SIP server 56 of FIG. 3.

According to one embodiment, a call to the particular contact center isreceived by the edge device 20, and the edge device signals the SIPserver 2100 to route the call. In response, the SIP server 2100determines a media service appropriate for servicing a portion of thecall, and identifies a media resource based on the type of mediaservice. For example, if the media service is IVR self-service, the SIPserver may identify the resource manager 50 in the remote operationsenvironment 52 based on the contact parameter stored in the SIP serverfor this particular service. In response to the identification, the SIPserver transmits a request to the resource manager 50 for connecting thecall to the remote control platform 52 which provides voice promptsduring the IVR self-service. Thus, for this portion of the call, a callleg 2104 a is established from the customer end user device to the edgedevice, and another call leg 2104 b from the edge device 20 to theremote media control platform 52.

During the call, the SIP server 2100 decides that another media serviceis to be provided for a different segment of the call. For example, themedia service may be playing music while the call is placed on hold(e.g. music-on-hold service). The request for this media service may be,for example, from a routing server (not shown) based on a routingstrategy executed by the routing server.

In response determining that a second media service is to be provided,the SIP server identifies the location of the media resource (e.g. alocal resource manager) to provide the requested service. In theillustrated example, a media control platform 2102 at the local premiseis invoked to provide media for the second portion of the call. For amusic-on-hold service, the media that is provided by the media controlplatform 2012 is music configured by the tenant for this service. Inthis regard, the call leg 2104 b from the edge device 20 to the remotemedia control platform 52 is replaced with a newly established call leg2104 c from the edge device 20 to the media control platform 2102. Inthis manner, media is moved from the remote operations environment 16 tothe local operations environment 12 via control signals transmitted bythe SIP server 2100 at the local operations environment.

FIG. 22 is a schematic block diagram of a hybrid operations environmentfor providing media services according to another embodiment of theinvention. According to this embodiment, a SIP server 2202 is at thecontact center premise 12 and media is provided via the resource manager52 and media control platform 50 in the remote operations environment16. The SIP server 2202 may be similar to the SIP server 56 of FIG. 2.In the illustrated embodiment, a call from the end user device 18arrives at a media gateway 2200 at the contact center premise 12 and theSIP server 2202 is invoked for routing the call. When media service isto be provided for the call, the SIP server identifies the resourcemanager 52 in the remote operations environment 16 (e.g. via thedirectory number configured at the SIP server for the particular mediaservice), and transmits a signaling message for providing the mediaservice to the resource manager 52. The resource manager forwards therequest to the media control platform 50 selected to handle the service,and a media path 2204 is established between the end user device and themedia control platform 50. Of course, intermediary software and/orhardware infrastructure may be invoked in establishing the media path.

FIG. 23 is a schematic block diagram of a hybrid operations environmentfor providing media services according to another embodiment of theinvention. According to this embodiment, a SIP server 2300 is in theremote operations environment 16 for controlling media while mediaitself is provided via a resource manager 2302 and media controlplatform 2304 at the local operations environment 12. The SIP server2300 may be similar to the SIP server 56 of FIG. 2. An inbound call isreceived at the edge device 20, and a request to route the call istransmitted to the SIP server 2300. If media is to be provided for thecall, the SIP server identifiers a resource manager, which, in theexample of FIG. 23, is the resource manager 2302 at the contact centerpremise 12. The resource manager in turn identifies the appropriatemedia control platform, which, in the example of FIG. 23, is the mediacontrol platform 2304 at the contact center premise 12. A media path2306 is established between the end user device 18 and the media controlplatform 2304.

III. Geo-Location Based Call Recording

Embodiments of the present invention are also directed to recording in acontact center that provides geo-location support. Geo-location supportallows a contact center with muti-site deployment of particularcomponents to select one of the multi-sites for invoking the componentsin the selected site. This helps minimize WAN traffic or minimizelatency in certain situations.

FIG. 11A is a schematic block diagram of a contact center systemillustrating cost and latency for an typical VoIP call without callrecording according to one embodiment of the invention. In theillustrated embodiment, a customer utilizes a media gateway 500 in aparticular geographic location 502 (e.g. Dallas, Tex.), to transmit aVoIP call to a contact center located in another geographic location 504(e.g. San Francisco, Calif.). One or more appliances 506-512 hosted atthe contact center premise 504 may be invoked for routing the call. Forexample, a SIP server 506 may determine that the call should be routedto an agent device 514 located in a second geographic location 516remote to both the first geographic location and the second geographiclocation. A media channel 518 that traverses the wide area network, suchas, for example, the public Internet, is established between the mediagateway 500 and the agent device 514. Voice data is transmitted via themedia channel. The latency and traffic created in transmitting the voicedata is the latency and traffic associated with traversing the wide areanetwork once, for each voice packet transmitted between the customer andthe agent.

FIG. 11B is a schematic block diagram of the contact center system ofFIG. 11A handling call recording according to existing solutions. In theillustrated prior art system, both the media control platform 510 andthe recording server 512 are deployed at the contact center premise 504.Thus, in response to the SIP server 510 transmitting a command to themedia control platform 510 to record the call between the customer andthe agent, an established media path 520 a, 520 b is bridged through themedia control platform 510 at the contact center premise 504, and mediatransported over the media path is recorded by the recording server 512also at the contact center premise. This solution, however, doubles thetraffic over the wide area network given that the traffic firsttraverses to the media control platform 510 before reaching itsdestination. The solution also adds to the latency of the media path.Such latency, however, may not be acceptable for real-time calls.

According to one embodiment, a contact center is enabled forgeo-location-based call recording which helps minimize latency and costassociated with traditional call recordings.

FIG. 11C is a schematic block diagram of a contact center systemconfigured for geo-location-based call recording according to oneembodiment of the invention. As in the example of FIG. 11B, a customerutilizes a media gateway 530 in a particular geographic location 532(e.g. Dallas, Tex.), to transmit a VoIP call to a contact center locatedin another geographic location 534 (e.g. San Francisco, Calif.).According to one embodiment, the contact center premise hosts appliancessuch as, for example, a SIP server 536, resource manager 538, andrecording server 540. In other embodiments, one or more of theappliances may be hosted in a remote operations environment such as, forexample, the remote operations environment 16 of FIG. 6.

According to one embodiment, one or more media control platforms 542associated with the contact center are distributed to differentgeographic regions, such as for example, the geographic location 532 ator near the media gateway 530. According to one embodiment, a pool ofmedia control platforms 542 is deployed in each geographic region. Forexample, a pool of media control platforms associated with a particularcontact center may be deployed somewhere in North America, another poolof media control platforms may be deployed somewhere in Europe, and yetanother pool of media control platforms may be deployed somewhere inAsia. The exact locations may depend on various factors, such as, forexample, the location of the contact center premise, amount of businessconducted in certain geographic regions, locations of agents, and thelike.

According to one embodiment, other contact center components such as therecording server 540, SIP server 536, and resource manager 538 are notdistributed to the various geographic locations. This helps minimizecost for the contact center without compromising quality of real timecalls between a customer and an agent. In other embodiments, one or moreof the other contact center components are deployed to the variousgeographic regions.

In the example of FIG. 11C, a customer utilizes the media gateway 530 toinitiate a call to the contact center. A SIP server 536 at the contactcenter premise 534 routes the call to an agent device 544 in ageographic location 546 remote from both the geographic location 532 ofthe media gateway 530 and the contact center premise 534. The SIP server536 further determines that the call should be recorded based on, forexample, a DN of the agent handling the call, an express request fromthe agent, or other configuration parameters accessed by the SIP serverfor the contact center. The SIP server 536 selects a geographic regionbased on one or more configuration parameters, and forwards the selectedgeographic region (e.g. geo-location=dallas) to the resource manager 538along with a request to record the call. The resource manager in turnruns a routine for selecting a media control pool tagged to theidentified geographic region. An appropriate media control platform isselected from the pool based on load balancing and other considerations,and a message for recording the call is transmitted to the selectedmedia control platform. An established media path 548 a, 548 b isbridged through the selected media control platform 542. Assuming thatthe media control platform 542 is local to the media gateway 530, themedia path 548 a between the media gateway and the media controlplatform 542 traverses a local network. Network latency is assumed to benegligible when media is sent over the local network.

The media path 548 a between the media control platform 542 and theagent device 544 traverses a wide area network. The latency associatedwith the media path 548 b is the latency associated with traversing thewide area network once. Thus, overall latency in the recorded mediacommunication between the customer and the agent is minimized whencompared to the prior art solution described with respect to FIG. 11B.

According to one embodiment, the replicated media is transmitted forrecording to the recording server 540 over the wide area network via amedia path 550. Any delay encountered in transmitting the media due totraffic on the wide area network may be acceptable due to the fact thatthe replicated media is generally not required to be available in realtime. In other embodiments, the recording server 540 is deployed in thesame geographic location as the media control platform 542. According tothose embodiments, the replicated media traverses a local networkinstead of the wide area network.

According to one embodiment, configuration of geo-location may happen,for example, in two places: DN objects in a switch, and resource groupsfor the media control platform and recording servers. A geo-location tagfor each DN (of type trunk DN, route point DN, extension DN, and trunkgroup DN) is assigned for the media control platform and recordingserver resource groups. A graphical user interface available to acontact center administrator may be used for the assignment of thegeo-location tags.

How a geo-location is selected for each call depends on how the SIPserver 536 is configured. According to one embodiment, the SIP Serverselects a geo-location with the following order or preference forinbound calls:

-   -   1) Geo-location configured in the extension of a request to        route a call (RequestRouteCall) (e.g. an agent's telephone        extension number);    -   2) Geo-location configured in the routing point DN (e.g. a DN        for a contact center component which may further route a call);    -   3) Geo-location configured in the inbound trunk DN (e.g. DN of a        trunk transporting an inbound call); and    -   4) Geo-location configured in the DN where the recording is        enabled.

Of course, other orders are also contemplated. For outbound calls, thefollowing order of preference may be used, although other orders arealso contemplated:

-   -   1) Geo-location configured in the extensions of        RequestRouteCall;    -   2) Geo-location configured in the routing point DN;    -   3) Geo-location configured in the agent DN; and    -   4) Geo-location configured in the outbound trunk DN if recording        is enabled

According to one embodiment, when a DN is configured to be recorded, thegeo-location set at the DN is selected. When more than one DN involvedin the call has a geo-location set (e.g. both inbound Trunk DN and theRouting Point DN have geo-location set), then the SIP server 536 may beconfigured to select the geo-location based on a configured order ofpreference, such as, for example, the preference described above.

The selection of the geo-location may also vary based on the routingstrategy invoked by the SIP server 502 for routing a particular call.For example, if a parameter “record=source” is set in the extensionidentified in a request to route the call, then the geo-location of theinbound Trunk DN of the call is selected if configured. If a parameter“record=destination” is set in the extension of the request to route thecall, then the geo-location of the agent (extension DN) is selected.Selection of the geo-location may also depend on instructions providedby a party specifically requesting dynamic recording.

IV. Handling Call Recording Failures

FIG. 12A is a schematic block diagram of a system for contact centercall recording and recording posting according to one embodiment of theinvention. The system includes a remote operations environment 600 withan edge device 604 for routing calls between customers that utilizevarious telephony service providers 606, and contact center resources ina contact center premise 602. The edge device 604, remote operationsenvironment 600, and contact center premise 602 may be similar torespectively the edge device 20, remote operations environment 16, andcontact center premise 12 of FIG. 6.

In the embodiment illustrated in FIG. 12A, the remote operationsenvironment 600 hosts a resource manager 610, media control platform608, and recording server 616 (which may be incorporated into the mediacontrol platform 608), which may be similar to respectively the resourcemanager 50, media control platform 52, and recording server 400 of FIG.6.

The contact center premise 602 hosts a SIP server 612 in communicationwith the resource manager 610 over a wide area network for signaling themedia control platform 608 to record media transmitted between an agentdevice 620 and a customer (via a telephony service provider 606). Inthis regard, a media path 622 a, 662 b is bridged by the media controlplatform 608, and media transmitted over the media path 622 a, 622 b isreplicated and transmitted to the recording server 616 via messagessimilar to the messages described with respect to FIG. 7.

The system of FIG. 12A further includes a mass storage device 624configured to store recordings transmitted by the recording server 616.The mass storage device may be, for example, an online storage in apublic cloud computing environment offered, for example, by Amazon WebServices (e.g. Amazon S3 online storage web service). The mass storagedevice 624 may also be a local storage device at the contact centerpremise 602.

According to one embodiment, the recording is encrypted by the mediacontrol platform 608 prior to posting into a bucket associated with thetenant for which recordings are being stored. The encryption of theaudio recording may be via an encryption key stored in the IVR profileof the tenant. An authorization key for posting in the mass storagedevice may also be obtained, as necessary, from the tenant's IVRprofile.

According to one embodiment, the remote control environment 600 furtherhosts a web server 614 providing a call recording API for interfacingwith the media control platform 608 and a graphical user interface 628.According to one embodiment, the media control platform 608 uses the APIto post call metadata for a recorded call, including a universalresource identifier (URI) or any other link to the recording stored inthe mass storage device 624. The graphical user interface 628 accessesthe API for accessing call recordings stored in the mass storage device624, and for performing searching and other analytics on the recordings.

According to one embodiment, a key management server 629 is deployed bya tenant for performing key management for the tenant for encryption anddecryption of call recordings. In this regard, the key management server629 provides a user interface for access by tenant administrators 627for uploading and managing certificates for the encryption anddecryption of the call recordings. The key management server 629 may bedeployed in the remote operations environment 600 (or another remoteenvironment) or at the contact center premise 602. In one embodiment,the graphical user interface 628 for accessing the call recordings isintegrated into the key management server 629.

The contact center premise 602 may host a server providing aninteraction concentrator (ICON) application 630 coupled to an ICONdatabase 632. According to one embodiment, the ICON application receivescall and other interaction event details from the SIP server 612 andstores the details in the ICON database 632. The web server 614 isconfigured to access the ICON database 632 over a wide area network andretrieve event details associated with the call metadata received fromthe media control platform 616, and store the event details andassociated call metadata in a call record maintained in a call database634.

FIG. 12B is a schematic block diagram of a system for contact centercall recording and recording posting according to another embodiment ofthe invention. The like element numbers are intended to indicate likeelements or features. The key management server according to theembodiment of FIG. 12B is referred to as a recording crypto server 629′.In this illustrated embodiment, the user interface 628 of FIG. 12A issplit into a playback user interface 628′ and recording user interface628″. The playback user interface 628′ provides prompts and othermechanisms for allowing a user to search, playback, and perform otheractions (e.g. searches for key words or phrases) relating to recordedcalls. The recording user interface 628″ provides prompts and othermechanisms for an administrator to manage cryptographic keys maintainedby the recording crypto server 629′.

According to the embodiment illustrated in FIG. 12B, a speech server 631provides the playback user interface 628′ to access and invoke variousfunctionalities of the speech server. The speech server 631 may besimilar to speech server 54 of FIG. 2, and may provide various speechanalytics and text processing functionalities as will be understood by aperson of skill in the art.

According to the embodiment illustrated in FIG. 12B, part of theprocessing by the web server 614 is called out and handled by a separaterecording processor 615. Specifically, it is the recording processor 615which executes instructions to access the ICON database 632, to retrieveevent details associated with the call metadata received from the mediacontrol platform 616, and to forward the event details and associatedcall metadata to the web server 614 for storing in a call recordmaintained in the call database 634. According to one embodiment, therecording processor 615 may be process or thread running in the same orseparate processor or computing device as the web server 614.

FIG. 12D is a more detailed block diagram of the recording userinterface 628″ according to one embodiment of the invention. Accordingto one embodiment, the recording user interface includes a front-end UIas well as a container for one or more plug-ins 691 for providingvarious recording related functionalities. A tenant administrator 627may, via an end user device, access the recording user interface 628″for accessing one or more of such functionalities. For example, theadministrator may access the interface to provide policies for callrecording maintenance by the web server 614′. The administrator may alsoaccess the interface for providing cryptographic keys and general keymanagement by the recording crypto server 629′. The administrator mayalso access the interface for setting call recording policies in aconfiguration server 690. According to one embodiment, a separate policyis maintained for each tenant, and may include policies such asrecording retention policies, policies related to where call recordingsare to be stored, policies relating to how a file name is to begenerated, and the like.

FIG. 13 is signaling flow diagram for posting a recorded call accordingto one embodiment of the invention. The media control platform 608detects in step 654 that a recording for media exchanged between party A650 and party B 652 has terminated. This may be based, on for example,one of the parties dropping off the call, an end-recording command fromone of the parties, or the like.

In step 656, the media control platform 608 encrypts and stores the callrecording in the mass storage device 624, and receives, in step 658,from a processor coupled to the mass storage device, a URI to therecording.

In step 660, the media control platform 608 posts to the web server 614call metadata including, for example, the received URI.

In step 662, the web server 614 or recording processor 615 performs aquery of the ICON database 632 for pulling additional call events fromthe database in step 664. In step 666, the web server stores the callmetadata and events in the call database 634. The web server may alsocache and batch-update the call records at a later time.

In step 668, the web server 614 informs the media control platform 608of results of the posting of the call recording.

When the media is bridged through the media control platform 608, theplatform becomes a single point of failure for the duration of thecommunication session. According to one embodiment, if the resourcemanager 610 detects failure of a particular media control platform 608,the resource manager notifies the SIP server 612 for prompting the SIPserver to take alternative action on the call.

FIGS. 14A-14B are signaling flow diagrams for handling failure of amedia control platform during a recording according to one embodiment ofthe invention. In step 700, the SIP server 612 provides a message to theresource manager 610 for prompting the resource manager to subscribe themedia control platform 608 with the SIP server. In response, theresource manager 610 transmits, in step 702, information on the mediacontrol platform 608 and other media control platforms it manages andwhich have been assigned to calls. Each media control platform mayhandle, for example, hundreds of calls at a time. The SIP server 612maintains this information in memory until the associated callsterminate.

While a recording for a particular call between party A 650 and party B652 bridged by the media controller 608 is in progress 704, the mediacontroller fails as depicted via step 706. The failure is detected bythe resource manager 610 via, for example, periodic heartbeat messagesbroadcast by the resource manager to all active media control platforms.

In step 708, the resource manager 610 transmits notification on thespecific media control platform 608 that has failed.

The failure of the media control platform 608 results in a break of themedia path between party A 650 and party B. Accordingly, in steps whichare referenced generally via reference 710, the SIP server 612 quicklyre-establishes the disconnected media path via standard SIP messages sothat the communication session continues. A media channel is establishedin step 712, and party A 650 continues to communicate with party B 652.

In steps 720-736, the SIP server 612 attempts to record the call againby initiating a new recording session with the same parameters. Thesteps taken by the SIP server 612 in establishing the new recordingsession for a particular call is similar to the steps discussed withreference to FIG. 7. In this regard, the SIP server 612 identifies,based on the subscription information, that the call between party A andparty B was assigned to the failing media control platform 608, andengages in pre-negotiation with the resource manager 610 for providing acopy of the established media session between party A 650 and party B652, to a second media control platform 800 selected by the resourcemanager 610. According to one embodiment, the information on the mediasession with party A is provided to the media control platform 800 instep 720 via the resource manager 610 via a session description protocol(SDP) that includes information such as, for example, IP address, portnumber, and codec used for sending and receiving RTP streams with partyA. Information on the media session with party B is similarly providedto the same media control platform in step 722.

In steps collectively referred to as step 724, the SIP server 612transmits a request to the second media control platform 800 to recordthe call. In this regard, during signaling which is collectivelyreferred to as step 726, the SIP server 612 transmits an INVITE messageto the media control platform 800 (via the resource manager 610), forestablishing a media path with party A 650, in which case the mediacontrol platform generates a session based on the session informationreceived in the pre-negotiation phase in step 720 for party A. A mediapath for the generated media session is then established via signalingbetween the SIP server 612 and party A 650, as shown collectively asstep 728.

Similarly during signaling which is collectively referred to generallyas step 730, the SIP server 612 transmits an INVITE message to thesecond media control platform 800 (via the resource manager 610), forestablishing a media path with party B 652. The media control platformgenerates, in response, a session based on the session informationreceived in the pre-notation phase in step 722 for party B. A media pathfor the generated media session is then established via signalingbetween the SIP server 612 and party B 652, as shown collectively asstep 732.

Media is then exchanged via established media paths 734 and 736. In thismanner, the second media control platform 800 bridges media betweenparty A 650 and party 652, and replicates the media for recording.

The following step 738 for posting the recorded media in the massstorage device 624 is similar to the steps discussed above with respectto FIG. 13.

In addition to re-recording the call upon the failure, the failed mediacontrol platform 608 instance also provides a mechanism to recover therecording up to the point of the failure. According to one embodiment,the media control platform 608 stores the call recording on a local diskas the recording is going on, which allows the recording to be submittedlater for storing in the mass storage device 624 when the media controlplatform 608 restarts.

FIG. 15 is a conceptual layout diagram of process for recovering arecording upon failure and recovery of a media control platformaccording to one embodiment of the invention. Prior to failure of themedia control platform 608, recording starts at time 900 and the mediacontrol platform writes the call metadata 902 to the local disk.According to one embodiment, once the metadata 902 is written to thedisk in the beginning of the recording, the media control platform 608does not modify the metadata file. Thus, according to one embodiment,runtime information such as timestamps of pause and resume periods arenot stored; however, audio masking is done in the audio file so there isno risk not masking the audio of sensitive and/or confidentialinformation. According to one embodiment, the metadata 902 is written tothe disk unencrypted since metadata does not contain sensitiveinformation.

In embodiments where the audio recording is to be stored in an encryptedform (based on configuration settings for a particular tenant), themedia control platform 608 begins to store encrypted audio recordingbeginning from time 900. As recording proceeds in time, the encryptedaudio recording is temporarily stored in the disk in blocks 908 a, 908b. According to one embodiment, the media control platform 608 uses anencryption algorithm based on, for example, the Advanced EncryptionStandard (AES), which allows block cipher so that encrypted audio may bewritten in blocks.

According to one embodiment, the media control platform 608 randomlygenerates a symmetric session key and uses the generated key to encryptthe audio. The session key is further encrypted using, for example, apublic key provisioned for the tenant, and the encrypted session key 904is also written to the disk at time 900 when the recording begins.According to one embodiment, the encryption of the session key isaccording to any one of various public key cryptography mechanisms knownin the art, such as, for example, public-key cryptography. The mediacontrol platform 608 does not have access to the symmetric key todecrypt the audio file as the key is protected by public key encryption.

According to one embodiment, audio header data 906 is also stored in thedisk when recording begins at time 900.

At time 902, the media control platform 608 fails. The recordingmetadata 902, encrypted session key 904, audio header 906, and encryptedaudio blocks 908 a, 908 b, however, remain on the disk. Assuming thatconversation continues during time 916, the conversation is recorded bythe second media control platform 800 as discussed with respect to FIGS.14A-14B.

At a later time 904, the media control platform 608 restarts. Accordingto one embodiment, upon restarting, the media control platform 608checks the local disk and detects recordings in the disk. The mediacontrol platform 608 packages the audio blocks 908 a, 908 b into apartial encrypted audio file 910, and posts the file to the mass storagedevice 624. In addition, the media control platform 608 submits therecording metadata and the encrypted session key to the call recordingAPI of the web server 614. Once the media control platform 608 submitsthe encrypted audio and full metadata to the call recording API of theweb server 614, the temporary files are removed from disk.

According to one embodiment, the audio in the partial audio file isassociated with a timestamp so that, upon playback, the partial audiofile is played in the correct order relative to other audio recordingsthat may have been written to the mass storage device by other mediacontrol platforms (e.g. the second media platform 800 that takes overafter failure of the media control platform 608), for the same call. Inthis regard, all audio files associated with a particular call areretrieved upon a command for playback, and the various call recordingsegments (each stored in a separate audio file) are stitched together sothat the audio stored in each call recording segment is played in thecorrect order. The fact that the recording of a call may be segmentedinto different audio files, however, is made transparent to a user whomay be searching and selecting an audio recording to be played. That is,the playback graphical user interface 628′ displays a single entry for acall recording which may be identified, for example, by a time, date,and identification of the members in the conversation.

FIG. 16 is a diagram of a structure of call recording metadata providedto the web server 614 according to one embodiment of the invention. Themetadata includes a “recordings” array 1000 which stores metadata for arecording segment for a call. From the perspective of the media controlplatform 608, it posts a single recording segment. Thus, according toone embodiment, the array size for the “recordings array” is one. The“recordings” array includes the following parameters:

-   -   “uri” 1002 contains the URI of the posted call recording in the        mass storage device 624.    -   “start” 1004 is a start time of the call recording.    -   “end” 1006 is an end time of the call recording.    -   “duration” 1008 is a duration of the call recording in seconds.    -   “parameters” 110 include certain parameters applied in the call        recording. This may include, for example, IVR profile service        parameters.    -   “metadata” 1012 are metadata parameters passed by the SIP server        612 to contain relevant metadata about the call recording, such        as, for example, the number that was called 1014, number for the        caller 1016, directory number of the agent who handled the call,        and the date and time the call was handled 1020.    -   “masks” 1022 is an array of timestamps and type information that        represent periods of pause and resume requests received for the        recording segment.    -   “pkcs7” 1024 is a parameter for storing the encrypted session        key provided by the media control platform 608 if the call is        encrypted. The symmetric session key(s) are encrypted via a        public key and stored as a base64 string.

A “metadata” parameter 1026 is also provided with a single propertyreferred to as a “uuid” for storing a unique identifier for the call.

V. Call Recording Encryption

As discussed above, certain tenants (e.g. contact centers providingbanking services) may want call recordings to be encrypted. As discussedabove, one or more session key(s) may be used to encrypt the audiorecordings for a tenant. The session keys may be protected via anypublic key cryptography mechanism known in the art. According to oneembodiment, a public-key cryptography system (PKCS), e.g. PKCS#7, isutilized. Other types of public key infrastructure (PKI) may also beused, such as for example PGP (pretty good privacy) mechanism. Thevarious keys described herein are generally referred to as cryptographickeys.

According to one embodiment, the recording crypto server 629′ deployedby a tenant provides manages public key certificates for the tenant forbinding a public key with the tenant. Multiple certificates may bemaintained for each tenant.

FIG. 12C is a conceptual layout diagram of various componentsinteracting with the recording crypto server 629′ for allowingencryption and decryption of call recordings according to one embodimentof the invention. A tenant administrator 670 accesses the recording userinterface 628″ (FIG. 12B) for providing various pieces of informationwhen uploading a public encryption key for each certificate to therecording crypto server 629′. According to one embodiment, the tenantadministrator provides a private key 674 (also referred to as adecryption key) used to decrypt the session key, a customer-assignedpassphrase 676 used to protect the private key, and a public key 678used to encrypt the session key. Although embodiments of the presentinvention contemplate a passphrase, a person of skill in the art shouldrecognize that the passphrase may be replaced with a password, passcode,or any other information that authenticates a user. Also, one or morepublic keys may be used for encrypting the session key.

The recording crypto server 629′ stores the public key(s) 678 in the IVRprofile 680 for the tenant. The IVR profile 680 may be maintained, forexample, by the configuration server 690 (FIG. 12D). The public key isthen provided to the media controller 616 for encrypting an audiorecording. In this regard, when the resource manager 610 forwards arecording request to the media control platform 608, the resourcemanager provides a database identifier of the IVR profile for the tenantfor whom the request is provided, and the media controller 616 retrievesthe public encryption key from the IVR profile for performing theencryption of the session key. According to one embodiment, if thetenant does not require encryption, the certificate is not configured inthe IVR profile. The media control platform does not encrypt an audiorecording if the certificate is not configured for a tenant.

In regards to the private key 674 received by the recording cryptoserver 629′, the server stores the private key in a key storageappliance 682. Security of the private key against unauthorized users isaided by encrypting the private key with the passphrase 682. In thisregard, the key storage appliance is a hardware encryption appliancethat encrypts the decryption key with the customer assigned passphrase.When a user 672 is ready to retrieve an audio recording, he or sheaccesses the playback graphical user interface 628′ (FIG. 12B) toidentify the appropriate recording (or portion of the recording) tolisten to. The user 672 further provides the passphrase 676 along with arequest for the identified recording, which is received by the webserver 614′ and forwarded to the recording crypto server 629′. Thepassphrase is used by the recording crypto server 629′ to access the keystorage appliance 682 and obtain the private key 674. According to oneembodiment, the passphrase is used to decrypt the encrypted private key.

According to one embodiment, the recording crypto server 629′ isconfigured to periodically rotate the public key 678 for a particulartenant. In this regard, the web server 614 may receive a session keyencrypted with a first public key from the media control platform 608.After such encryption, the recording crypto server 629′ may receive anew public key, along with a private key associated with the new publickey, from the tenant administrator. According to one embodiment, thereceipt of the public key causes retrieval and update of metadata of thecall recordings stored for the particular tenant. Specifically, thesession keys encrypted with the first public key are first decryptedbased on the old private key, and re-encrypted using the new public key.The newly encrypted session key is stored in the call record for thecall recordings maintained for the tenant.

According to one embodiment, the particular tenant administrator mayprovide a series of public keys to be stored in the tenant's IVRprofile. The recording crypto server 629′ may be configured to select,in a rotating fashion, which public key to be used for encrypting asession key. The rotating of the public key may be done on a period ornon-periodic basis.

According to one embodiment, the rotation of the public encryption keymay be done in batch for a plurality of call recordings. The updating ofthe public key without updating the session key avoids having tore-encrypt the audio data hosted in the mass storage device 624,avoiding costs associated with fetching the data from the mass storagedevice, re-encrypting the data, and then posting back to the massstorage device.

According to one embodiment, the playback user interface 628′ is invokedfor decryption and playback of encrypted audio files by an authorizeduser. In this regard, the user interface is invoked to select aparticular audio recording, and the URI of the selected audio recordingis passed to the web server 614 along with the passphrase 676. The webserver fetches the encrypted session key and the encrypted audio data,and packages the two components (along with the passcode) as, forexample, a single PKCS#7 component that is transmitted to the recordingcrypto server 629′. The recording crypto server 629′ is configured touse the passcode to obtain the private key, and use the private key todecrypt the content and return the decrypted audio to the user via, forexample HTTPS (Hypertext Transfer Protocol), or other securecommunication protocol.

VI. Call Event Tagging for Contact Center Call Recordings

As described above, the web server 614 receives call metadata for arecording segment for a call. According to one embodiment, a list ofcall events is submitted to the web server 614 as part of the callrecording metadata. Each call event may be associated with a timestampto allow navigation to the associated portion of the voice file duringplayback.

FIG. 17 is a diagram of a structure of call recording metadata providedto the web server 614 according to one embodiment of the invention. Themetadata includes “metadata” parameters 1100 and a “recordings” array1102 similar to the “metadata” parameters 1012 and “recordings” array1000 of FIG. 16.

The call metadata further includes an “events” structure 1104 with anarray of events 1106 a-1106 c. With respect to one exemplary event 1106a, each even includes a timestamp 1108 in which the event occurred foridentifying the portion of the voice recording associated with theevent. The event data in the example further identifies a DN 1110 of theagent involved in the event, and an event descriptor 1112 (e.g.indicating that a connection was made with the DN). Other events mayinclude, for example, a party joining the call, a party beingdisconnected to the call, and the like.

FIG. 18 is a conceptual layout diagram of a call record 1800 displayedby the playback graphical user interface 628′ according to oneembodiment of the invention. According to one embodiment, the callrecord 1800 is associated with a call conducted on a particular date andtime 1801, between a calling number 1803 and a called number 1804.According to one embodiment, information in the call record 1800 mightbe collapsed until receipt of a user command to expand the call record.

Information in the call record may include, for example, call event datacollected for a call. Such event data may be retrieved from the array ofevents 1106 a-1106 c stored in the events structure 1104 of FIG. 17.Each call event data may be identified with a time, a telephone numberwith which the event is associated, a name associated with the number,and a description of the event.

According to one embodiment, tags may be stored as part of the callrecording metadata as an event parameter, and displayed under a “calltags” field 1802 when the call record 1800 is displayed. Tags may besimilar to events in that tags are associated with a timestamp, atelephone number, a name associated with the number, and an eventdescription. The timestamp indicates the time in which the tag wasadded. The event description indicates that a tag was added. Thetelephone number and name identify the person adding the tag.

According to one embodiment of the invention, tags are generated andadded as part of the call recording metadata based on actions taken bythe contact center agent (as identified via his telephone number andname). The action may be an express command by the agent to add a tagduring a particular point of a conversation with a customer. Accordingto one embodiment, the agent device may provide various tag icons, menuitems, or the like, that the agent may select depending on a particularsubject that was discussed at a particular point in time, customersentiment (e.g. angry customer which generates an ANGRY_CUSTOMER tag),and any other information about the conversation. The tags may also begenerated automatically, for example, based on analysis of customer toneof voice, and the like. In another example, identification of aparticular department to which a call is transferred may cause theautomatic generating of a call tag. In yet another example, the invokingof particular forms (e.g. a new credit card application form) may causea tag associated with the form to be identified, associated with atimestamp in which the form was invoked, and added as metadata for thecall recording.

According to one embodiment, call recording metadata may be searched andfiltered for specific tags. For example, a supervisor may want to searchfor all calls tagged with a credit card tag. The display may beconfigured to display the call recordings having this attached data.

The tags may be used to navigate to a specific point in a call recordingthat may be of interest to the listener. For example, a supervisor mayfast forward to the interesting part of the recorded conversation, suchas, for example when the conversation switched from credit card tochecking account and agent tagged the call as PERSONAL_CHECKING.

According to one embodiment, selection of an event marked by aparticular call tag causes identification of the timestamp assigned tothe tag, and retrieval of the portion of the audio associated with thetimestamp for playing the audio. According to one embodiment, the audiostarts playing from the time indicated by the timestamp. The audio playsuntil the listener commands the playback to stop. The event may behighlighted as the recording plays.

VII. Call Recording Stitching for Multi-Site Contact Centers

According to one embodiment, a call may be transferred from one SIPserver which may be located in one location, to another SIP server whichmay be located in another location, such as, for example, when a call istransferred from one department to another. According to one embodiment,each SIP server 612 is configured to store call event data in a separateICON database 632. It is desirable to query the multiple ICON databasesin order to track call events associated with a particular call acrossmultiple SIP servers.

According to one embodiment, a call uuid generated by each SIP serverfor the segment of the call handled by the server is associated with aseparate call recording metadata. A separate audio file may also begenerated for each segment of the call. According to one embodiment, thecall recording metadata may be linked to other call recording metadatavia, for example, “next” and “previous” properties. The “next” propertymay be a link (e.g. URI) to a next call metadata record generated whenthe call is transferred by a current SIP server to a next SIP server,while the “previous” property may be a link to a previous call metadatarecord generated by a previous SIP server before the call is transferredto the current SIP server.

FIGS. 19 and 20 are diagrams of the structure of call recording metadatagenerated for different segments of a call according to one embodimentof the invention. In the illustrated example, call recording metadata2000 is generated for a first segment of a call which occurs before thecall is transferred to another SIP server. The transfer of the callcauses the generating of another call recording metadata 2002 for asecond segment of the call. According to one embodiment, each metadataincludes a call uuid identifying the segment of the call, and a link2010, 2012 to the call recording in the mass storage device 624.

The call recording metadata 2000 for the first segment of the callincludes a “next” link 2014 including the call uuid of the next segmentof the call. The “next” link thus allows the retrieval of the callrecording metadata 2002 generated for the second segment of the call.Similarly, the call recording metadata 2002 includes a “previous” link2016 including the call uuid of the previous segment of the call. The“previous” link thus allows the retrieval of the call recording metadata2000 generated for the first segment of the call.

According to one embodiment of the invention, although a single call maybe associated with multiple call uuids generated by different SIPservers, a single recording entry may be displayed by the playbackgraphical user interface 628′. The single recording entry may then beexpanded to display the various call events tracked for the call asdescribed with respect to FIG. 18. The playback graphical user interface628′ may be configured to follow the links in the call metadata recordsgenerated for each call segment for identifying call events associatedwith multiple call segments handled by different SIP servers. Byfollowing the links and identifying all call metadata records associatedwith the single call, the various call recording files may be identifiedand stitched together for playback in a seamless manner.

Each of the various servers in the afore-described figures may be aprocess or thread running on one or more processors in a single ormultiple computing devices, executing computer program instructions andinteracting with other system components for performing the variousfunctionalities described herein. The computer program instructions arestored in a memory which may be implemented in a computing device usinga standard memory device, such as, for example, a random access memory(RAM). The computer program instructions may also be stored in othernon-transitory computer readable media such as, for example, a CD-ROM,flash drive, or the like. Also, a person of skill in the art shouldrecognize that a computing device may be implemented via firmware (e.g.an application-specific integrated circuit), hardware, or a combinationof software, firmware, and hardware. A person of skill in the art shouldalso recognize that the functionality of various computing devices maybe combined or integrated into a single computing device, or thefunctionality of a particular computing device may be distributed acrossone or more other computing devices without departing from the scope ofthe exemplary embodiments of the present invention. A server may be asoftware module, which may also simply be referred to as a module. Theset of modules in the contact center may include servers, and othermodules.

VIII. Network Recording and Speech Analytics

According to one embodiment, a network recording and speech analyticssystem and method are provided for intercepting and recording callsbetween two entities that may or may not be part of a contact center.The network recording system is configured to analyze the recordings ona real-time (as the call is occurring) or non-real time (after the callis complete) basis. According to one embodiment, the software andhardware resources needed to record, store, and analyze the callrecordings are hosted in a remote computing environment similar to theremote computing environment of the previous embodiments. The callrecordings may be encrypted prior to storing as discussed in regards tothe previous embodiments.

According to one embodiment, the network recording system is configuredto capture contact center agent data such as an agent's name, skilllevels, queue information, agent location, and call data (also referredto as attached data) of calls handled by the agent. The call data mayinclude the called telephone number, customer account information (ifany), customer history information, and/or any other data maintained ina customer or call database. The captured data is metadata that may thenbe stored in association with the call recording.

According to one embodiment, calls are processed from the PSTN andforwarded to a remote computing environment where the calls are recordedas they pass-through the remote computing environment, and directed backto a called destination such as, for example, a contact center. Therecording of the call is stored in a storage device such as, forexample, the mass storage device 624 of FIGS. 12A-12B. Metadataassociated with the call (e.g. agent data and call data) is alsocaptured and stored in association with the call recording. Other typesof metadata described in connection with the previous embodiments mayalso be captured and stored.

According to some embodiments, the following metadata may be capturedand stored:

1) All metadata obtained via a call server (e.g. SIP server or othercall controller coupled to a telephony switch) on a business premisebased on the caller's telephone number. In this regard, the networkrecording system is configured to monitor the call routing behind thetelephony switch until the call is disconnected. This enables callmonitoring outside of the contact center environment, such as, forexample, a financial trading environment.

2) Key words or phrases uttered during the call and captured using anyspeech recognition system and method conventional in the art. Theutterance may be converted into text and stored as metadata for beinganalyzed for potential actionable event using, for example a system andmethod as described in U.S. application Ser. No. 13/893,036, filed onMay 13, 2013, entitled Actionable Workflow Based on InteractionAnalytics Analysis, the content of which is incorporated herein byreference.

3) Multi-channel metadata captured from other media channels before thecall arrives at the remote computing environment. According to oneembodiment, the local premise of the business may host a conversationserver for capturing data from non-voice interactions with the customerusing other media channels, such as, for example, web interactions,e-mails, social media interactions, and/or interactions using mobiledevices, and leverage this data as part of the metadata and speech/textanalysis process. The multi-channel metadata may also be included asmetadata for historic record.

4) Mobile application and device information in embodiments where thecaller invokes a mobile application via a mobile device to initiate thecall. Such information may include, for example, a caller's location,device information, security tokens, application data, and the like.

5) Service control point (SCP) information such as subscriberinformation of the caller or callee captured at various routing pointsin the carrier network.

6) Information about the employee engaged in the call such as, forexample, employee name, ID, location, computer location, skill level,title, and other data stored in a employee database.

7) Interactions between the caller and an automated response system suchas, for example, an IVR.

8) Mobile device application or system information obtained over a datachannel separate from the voice call. Such information may include, forexample, location of the mobile device, device ID, user ID,authentication tokens, and the like. For example, a customer callingfrom an iPhone application may provide device, authentication, andapplication details out-of-band of the voice call, and the data may bestored as metadata in association with the call recording.

According to one embodiment, both the call recording and metadata, oncecaptured, may be accessed through a web browser and secured via localencryption and userid/passwords. The web browser may be provided by theweb server 614, 614′ of FIGS. 12A, 12B. The web browser may also providea playback user interface such as the playback user interface 628, 628′of FIGS. 12A, 12B.

Once the calls and metadata are stored, speech analysis tools may beinvoked to analyze the stored speech data for automatic speechrecognition and text-to-speech functionality. The speech analysis toolsmay be provided by speech servers similar to speech servers 54 of FIG.2, and speech servers 631 of FIG. 12B. Speech analysis may be performedto determine, along with the associated metadata, agent/employeeperformance, quality management, compliance (e.g. compliance in thefinancial industry with laws such as those provided in the Dodd-FrankAct), fraud detection, and the like. Details of a system and method forautomatically classifying a communication based on speech analytics isdisclosed in U.S. Pat. No. 7,487,094 entitled “System and Method of CallClassification with Context Modeling Based on Composite Words,” thecontent of which is incorporated herein by reference.

Metadata and call recordings may be stored in the remote computingenvironment and accessed anytime by an authorized user over, forexample, the Internet. Authorized users may decide when or how often topurge the call recordings and associated metadata. Of course, all or aportion of the call recordings and associated metadata may also bestored in a local computing environment.

According to one embodiment, software and hardware needed for callrecording and analysis in the remote computing environment may bemanaged and maintained by a call recording operator/provider. Theoperator may offer a call recording solution to a customer through a PPU(Pay for Use) business model. The operator may also allow speechanalytics to be turned on or off as desired by the customer. Forexample, the customer may want to turn on speech analytics and“self-discover” customer trends without the need for special programmingor implementation. The customer may also decide the percentage of callsthat are to be analyzed based on the number of calls being redirected tothe remote computing environment from the PSTN carrier. The customer mayalso decide the period of time in which recordings are maintained andhow often the recording and related metadata is purged. Such options maybe configured by the customer by accessing a configuration GUI providedby the call recording operator/provider. By offering call recording andanalytics from a remote computing environment, the customer need notmake changes to its telecommunications routing nor provision settings onits telephony switch. The customer's existing configuration andinfrastructure may be leveraged to reduce efforts in providing to thecustomer the ability to record and analyze calls to the business.

According to one embodiment, in embodiments where the businesspurchasing the call recording solution is in the financial industry,certain measures may be taken to ensure that an employee of the businessutilizes company issued computers and resources during a call between acustomer and the employee. In this regard, a cryptographic key may beprovided to the calling customer for being presented during the call.The cryptographic key may allow the employee to be authenticated to thecompany-issued computer to access the caller's information and conductother transactions using the computer. For example, a trader thatreceives a call from a client would have to enter the key as part of alogin sequence for the customer in order to access the customer'srecord. The company may determine when and where the application can beaccessed. The key helps ensure that the customer does not bypasscorporate security and logging controls.

FIG. 24 is a schematic block diagram of a network recording and speechanalytics system according to one embodiment of the invention. Thesystem includes software and hardware resources (collectively referredto as recording system) in a remote computing environment 1516 similarto the software and hardware resources in the remote computingenvironment 600 of FIGS. 12A and 12B. Such resources include, withoutlimitation, an edge device 1506, speech server 1508, media server 1510,and metadata server 1512. The edge device 1506 may be similar to edgedevice 604 of FIGS. 12A and 12B. The media server 1510 may be similar tomedia server 608 of FIGS. 12A and 12B. The speech server 1508 may besimilar to the speech server 631 of FIG. 12B. The metadata server 1512may be similar to the recording processor 615 of FIG. 12B.

According to one embodiment, a call from a customer is received at aservice control point, service management system, or intelligentperipheral (collectively referred to as a routing point) in, forexample, a carrier PSTN network 1500. The routing point is configuredwith logic (e.g. software, hardware, and/or firmware) to route the callto a destination based on the called number. According to oneembodiment, the routing point determines prior to routing the callwhether the call is to be forwarded to the recording system in theremote computing environment 1516 over a wide area network forrecording. The determination may be based, for example, on the callednumber, a calling number, and/or call volume. For example, the routingpoint may be configured to identify all (or a percentage) of callsoriginating from a particular number or directed to a particular number,retrieve rules configured for the matching number(s) to determinewhether the call is to be recorded, and if so, forward the call to therecording system. A matching rule may provide, for example, an IPaddress of the edge device 1506 in the remote computing environment towhich to forward the call for recording. The rule may further indicateconditions that are to be satisfied before forwarding the call to theedge device. A call leg 1518 is established via, for example, SIPsignaling, between the routing point and the edge device 1506 inresponse to determining that the call is to be recorded. If the call isnot to be recorded, a call leg 1520 is established via, for example SIPsignaling, between the routing point and a destination device located atthe enterprise premise 1502.

According to one embodiment, the edge device and/or media server 1510 isconfigured to bridge the incoming call leg 1518 with an outgoing callleg 1522 for allowing exchange of RTP traffic between the callingcustomer and an enterprise agent 1514. The agent may be any employee ofthe enterprise and not necessary a contact center agent. Routing rulesmay not be necessary to direct the call to the employee.

The edge device 1506 may communicate with the media server 1510 torecord (e.g. replicate and store) media exchanged during the call. Therecording may capture all prompts (e.g. IVR, call queue announcements,etc.) as well as customer and agent conversations. Call recordingcontinues even if calls are transferred between agents or conferencetogether with other agents and/or supervisors, and stops upon detectingthe end of the call. According to one embodiment, the call recording isstored in a mass storage device such as, for example, the mass storagedevice 624 of FIGS. 12A, 12B.

According to one embodiment, the logic that determines whether a callshould be recorded or not (e.g. based on call volume) is located at theedge device 1506 or media server 1510. All or a portion of that logicmay also be located at a routing point in the PSTN network.

According to the embodiment where the logic is located at the edgedevice/media server, the routing point at the carrier PSTN networkforwards all calls that match a particular number to the recordingsystem, and it is the recording system that determines ultimately that,based on specific rules configured by a customer subscribing to therecording service, that the call is to be recorded. A separateaccounting server (not shown) may also be invoked to keep track ofrecordings and analytics performed for the particular customer forappropriate billing of the customer.

The metadata server 1512 is configured to capture metadata related tothe call for storing in a mass storage device in association with therecorded media. Such metadata may be obtained, for example, viainformation provided by a switch 1504 at the enterprise premise 1502.The information may include ACD statistics, queue statistics, agentavailability, and the like. The captured metadata is stored in a localfile system or remote database such as, for example, the call database634 of FIGS. 12A, 12B.

The speech server 1508 may be invoked to analyze the media for key termsand/or phrases. Captured metadata may also be considered during thisanalysis. The recorded media, metadata, and/or analysis data (e.g. inthe form of reports) may be provided to a requesting, authorized userupon receipt of a command. According to one embodiment, a web interfacesuch as, for example, the playback user interface 628′ of FIG. 12B isprovided for allowing access by authorized users to reporting and callrecording playback.

Each of the various servers, controllers, switches, and/or gateways inthe afore-described figures may be a process or thread, running on oneor more processors, in one or more computing devices, executing computerprogram instructions and interacting with other system components forperforming the various functionalities described herein. The computerprogram instructions are stored in a memory which may be implemented ina computing device using a standard memory device, such as, for example,a random access memory (RAM). The computer program instructions may alsobe stored in other non-transitory computer readable media such as, forexample, a CD-ROM, flash drive, or the like. Also, a person of skill inthe art should recognize that a computing device may be implemented viafirmware (e.g. an application-specific integrated circuit), hardware, ora combination of software, firmware, and hardware. A person of skill inthe art should also recognize that the functionality of variouscomputing devices may be combined or integrated into a single computingdevice, or the functionality of a particular computing device may bedistributed across one or more other computing devices without departingfrom the scope of the exemplary embodiments of the present invention. Aserver may be a software module, which may also simply be referred to asa module. The set of modules in the contact center may include servers,and other modules.

It is the Applicant's intention to cover by claims all such uses of theinvention and those changes and modifications which could be made to theembodiments of the invention herein chosen for the purpose of disclosurewithout departing from the spirit and scope of the invention. Thus, thepresent embodiments of the invention should be considered in allrespects as illustrative and not restrictive, the scope of the inventionto be indicated by claims and their equivalents rather than theforegoing description.

The invention claimed is:
 1. A method for network recording comprising:receiving by a recording system having a processor, memory, and massstorage device, media exchanged between first and second communicationdevices during a telephony call, the media being received by therecording system over a wide area network; bridging by the recordingsystem a media path between the first and second communication devices;replicating by the recording system media exchanged in the media pathfor storing the replicated media in the mass storage device; capturingby the recording system metadata associated with the call; storing thecaptured metadata in association with the stored media; and providingthe stored media and metadata to a requesting device over the wide areanetwork, wherein a routing point coupled to the recording system overthe wide area network is configured to receive a request for thetelephony call from the first communication device and determine whetherthe telephony call is to be recorded, and in response to determiningthat the telephony call is to be recorded the routing point isconfigured to establish a first call path between the routing point andthe recording system, and in response to determining that the telephonycall is not to be recorded, the routing point is configured to establisha second call path between the routing point and the secondcommunication device without establishing the first call path.
 2. Themethod of claim 1, wherein the metadata includes data from interactionswith a party associated with the first communication device, wherein theinteractions are non-voice interactions.
 3. The method of claim 1,wherein the metadata includes information on a party associated with thefirst or second communication device.
 4. The method of claim 1, whereinthe metadata includes information on the first or second communicationdevice.
 5. The method of claim 1, wherein the metadata includes a worduttered during the call.
 6. The method of claim 1 further comprising:analyzing by the recording system the stored media for detecting atleast one of a key word or phrase; and triggering an action in responseto the analysis.
 7. The method of claim 1, wherein the recording systemis hosted in a computing environment geographically remote from thefirst and second communication devices.
 8. A method for networkrecording comprising: receiving by a recording system having aprocessor, memory, and mass storage device, media exchanged betweenfirst and second communication devices during a telephony call, themedia being received by the recording system over a wide area network;bridging by the recording system a media path between the first andsecond communication devices; replicating by the recording system mediaexchanged in the media path for storing the replicated media in the massstorage device; capturing by the recording system metadata associatedwith the call; storing the captured metadata in association with thestored media; providing the stored media and metadata to a requestingdevice over the wide area network; encrypting by the recording systemthe replicated media via a first cryptographic key for storing theencrypted media in the storage device; and encrypting by the recordingsystem the first cryptographic key via a second cryptographic key forstoring the encrypted first cryptographic key as metadata for theencrypted media.
 9. A method for network recording comprising: receivingby a recording system having a media controller, memory, and massstorage device, media exchanged between first and second communicationdevices during a telephony call, the media being received by therecording system over a wide area network; bridging by the recordingsystem a media path between the first and second communication devices;replicating by the recording system media exchanged in the media pathfor storing the replicated media in the mass storage device; capturingby the recording system metadata associated with the call; storing thecaptured metadata in association with the stored media; providing thestored media and metadata to a requesting device over the wide areanetwork; establishing by a call controller in a first geographic region,the telephony call between the first and second communication devices;identifying by the call controller a second geographic locationassociated with a resource involved in the telephony call; andidentifying by the call controller the media controller based onlocation of the media controller in the identified second geographiclocation.
 10. A method for network recording comprising: receiving by arecording system having a media controller, memory, and mass storagedevice, media exchanged between first and second communication devicesduring a telephony call, the media being received by the recordingsystem over a wide area network; bridging by the recording system amedia path between the first and second communication devices;replicating by the recording system media exchanged in the media pathfor storing the replicated media in the mass storage device; capturingby the recording system metadata associated with the call; storing thecaptured metadata in association with the stored media; providing thestored media and metadata to a requesting device over the wide areanetwork; identifying by a call controller the first media controllercurrently assigned to the telephony call; detecting by the callcontroller failure of the first media controller during the telephonycall, wherein the failure of the media controller tears down the mediapath; in response to detecting the failure, bridging by the callcontroller a second media path between the first and secondcommunication devices until a second media controller is identified; andin response to the second media controller being identified, signalingby the call controller the second media controller to bridge and recordmedia exchanged during the telephony call.
 11. A method for networkrecording comprising: receiving by a recording system having a mediacontroller, memory, and mass storage device, media exchanged betweenfirst and second communication devices during a telephony call, themedia being received by the recording system over a wide area network;bridging by the recording system a media path between the first andsecond communication devices; replicating by the recording system mediaexchanged in the media path for storing the replicated media in the massstorage device; capturing by the recording system metadata associatedwith the call, wherein the metadata includes a link to a recording ofmedia exchanged during the telephony call; storing the captured metadatain association with the stored media; providing the stored media andmetadata to a requesting device over the wide area network; receiving bythe recording system a call event associated with the telephony call,the call event including a timestamp of when the event occurred duringthe telephony call; storing by the recording system the call event in adatabase record; retrieving by the recording system the database recordfor displaying the call event on a display device; receiving by therecording system a user command identifying the call event in responseto the display on the display device; and retrieving by the recordingsystem a portion of the recording associated with the call event inresponse to the user command for providing an audible rendering of theretrieved portion of the recording.
 12. A system for recording media fora contact center comprising: a first processor; a first memory, whereinthe first memory has stored thereon instructions that, when executed bythe first processor, cause the first processor to: receive mediaexchanged between first and second communication devices during atelephony call, the media being received by the recording system over awide area network; bridge a media path between the first and secondcommunication devices; and replicate media exchanged in the media pathfor storing the replicated media in the mass storage device; and asecond processor coupled to the first processor; and a second memory,wherein the second memory has stored thereon instructions that, whenexecuted by the second processor, cause the second processor to: capturemetadata associated with the call; and store the captured metadata inassociation with the stored media, wherein a routing point coupled tothe first processor over the wide area network is configured to receivea request for the telephony call from the first communication device anddetermine whether the telephony call is to be recorded, and in responseto determining that the telephony call is to be recorded, the routingpoint is configured to establish a first call path between the routingpoint and the first processor, and in response to determining that thetelephony call is not to be recorded, the routing point is configured toestablish a second call path between the routing point and the secondcommunication device without establishing the first call path.
 13. Thesystem of claim 12, wherein the metadata includes data from interactionswith a customer of the first communication device, wherein theinteractions are conducted over media channels other than a telephonychannel.
 14. The system of claim 12, wherein the metadata includesinformation on a user of the first or second communication device. 15.The system of claim 12, wherein the metadata includes information on thefirst or second communication device.
 16. The system of claim 12,wherein the metadata includes a word uttered during the call.
 17. Thesystem of claim 12 further comprising: a third processor; and a thirdmemory wherein the third memory has stored thereon instructions that,when executed by the third processor, cause the third processor to:analyze the stored media for detecting at least one of a key word orphrase.
 18. A system for recording media for a contact centercomprising: a first processor; a first memory, wherein the first memoryhas stored thereon instructions that, when executed by the firstprocessor, cause the first processor to: receive media exchanged betweenfirst and second communication devices during a telephony call, themedia being received by the recording system over a wide area network;bridge a media path between the first and second communication devices;and replicate media exchanged in the media path for storing thereplicated media in the mass storage device; encrypt the replicatedmedia via a first cryptographic key for storing the encrypted media inthe storage device; and encrypt he first cryptographic key via a secondcryptographic key for storing the encrypted first cryptographic key asmetadata for the encrypted media; and a second processor coupled to thefirst processor; and a second memory, wherein the second memory hasstored thereon instructions that, when executed by the second processor,cause the second processor to: capture metadata associated with thecall; and store the captured metadata in association with the storedmedia.
 19. A system for recording media for a contact center comprising:a media controller; a first memory, wherein the first memory has storedthereon instructions that, when executed by the media controller, causethe media controller to: receive media exchanged between first andsecond communication devices during a telephony call, the media beingreceived by the recording system over a wide area network; bridge amedia path between the first and second communication devices; andreplicate media exchanged in the media path for storing the replicatedmedia in the mass storage device; a processor coupled to the mediacontroller; a second memory, wherein the second memory has storedthereon instructions that, when executed by the processor, cause theprocessor to: capture metadata associated with the call; and store thecaptured metadata in association with the stored media; and a callcontroller in a first geographic region, the call controller beingconfigured to: establish the telephony call between the first and secondcommunication devices; identify a second geographic location associatedwith a resource involved in the telephony call; and identify the mediacontroller based on location of the media controller in the identifiedsecond geographic location.
 20. A system for recording media for acontact center comprising: a media controller; a first memory, whereinthe first memory has stored thereon instructions that, when executed bythe media controller, cause the media controller to: receive mediaexchanged between first and second communication devices during atelephony call, the media being received by the recording system over awide area network; bridge a media path between the first and secondcommunication devices; and replicate media exchanged in the media pathfor storing the replicated media in the mass storage device; a processorcoupled to the media controller; a second memory, wherein the secondmemory has stored thereon instructions that, when executed by theprocessor, cause the processor to: capture metadata associated with thecall; and store the captured metadata in association with the storedmedia; and a call controller configured to: identify the first mediacontroller currently assigned to the telephony call; detect failure ofthe first media controller during the telephony call, wherein thefailure of the media controller tears down the media path; in responseto detecting the failure, bridge a second media path between the firstand second communication devices until a second media controller isidentified; and in response to the second media controller beingidentified, signal the second media controller to bridge and recordmedia exchanged during the telephony call.
 21. A system for recordingmedia for a contact center comprising: a media controller; a firstmemory, wherein the first memory has stored thereon instructions that,when executed by the media controller, cause the media controller to:receive media exchanged between first and second communication devicesduring a telephony call the media being received by the recording systemover a wide area network; bridge a media path between the first andsecond communication devices; and replicate media exchanged in the mediapath for storing the replicated media in the mass storage device; aprocessor coupled to the media controller; a second memory, wherein thesecond memory has stored thereon instructions that, when executed by theprocessor, cause the processor to: capture metadata associated with thecall, wherein the metadata includes a link to a recording of mediaexchanged during the telephony call; and store the captured metadata inassociation with the stored media; and receive a call event associatedwith the telephony call, the call event including a timestamp of whenthe event occurred during the telephony call; and store the call eventin a database record, wherein the system further includes a playbackdevice configured to: retrieve the database record for displaying thecall event on a display device; receive a user command identifying thecall event in response to the display on the display device; andretrieve a portion of the recording associated with the call event inresponse to the user command for providing an audible rendering of theretrieved portion of the recording.